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(2) INFORMATION FOR SEQ ID N0:1: 



Tyr His Arg Gly Thr Cys Met Gly 
10 15 

He He Ala Val He Val Val Leu 
25 30 

Ser He Gly Ser Asp Leu Gly Ala 
45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Met Arg Leu Cys Leu He He He 
1 5 

Gly He Ser He Trp Gin Leu Leu 

20 

Leu Phe Gly Thr Lys Lys Leu Gly 
35 40 



Ser lie Lys Gly Phe Lys Lys Ala Met Ser Asp Asp Glu Pro Lys Gin 
50 55 60 

Asp Lys Thr Ser Gin Asp Ala Asp Phe Thr Ala Lys Thr lie Ala Asp 
65 70 75 80 

Lys Gin Ala Asp Thr Asn Gin Glu Gin Ala Lys Thr Glu Asp Ala Lys 
85 90 95 

Arg His Asp Lys Glu Gin Gly Val Asn Pro Cys Leu lie Ser Val Leu 
100 105 110 

Ala Asn Leu Leu Leu Val Phe lie lie Gly Leu Val Val Leu Gly Pro 
115 120 125 

Gin Arg Leu Pro Val Ala Val Lys Thr Val Ala Gly Trp lie Arg Ala 
130 135 140 

Leu Arg Ser Leu Ala Thr Thr Val Gin Asn Glu Leu Thr Gin Glu Leu 
145 150 155 160 

Lys Leu Gin Glu Phe Gin Asp Ser Leu Lys Lys Val Glu Lys Ala Ser 
165 170 175 

Leu Thr Asn Leu Thr Pro Glu Leu Lys Ala Ser Met Asp Glu Leu Arg 
180 185 190 

Gin Ala Ala Glu Ser Met Lys Arg Ser Tyr Val Ala Asn Asp Pro Glu 
195 200 205 

Lys Ala Ser Asp Glu Ala His Thr lie His Asn Pro Val Val Lys Asp 
210 215 220 

Asn Glu Ala Ala His Glu Gly Val Thr Pro Ala Ala Ala Gin Thr Gin 
225 230 235 240 

Ala Ser Ser Pro Glu Gin Lys Pro Glu Thr Thr Pro Glu Pro Val Val 
245 250 255 

Lys Pro Ala Ala Asp Ala Glu Pro Lys Thr Ala Ala Pro Ser Pro Ser 
260 265 270 

Ser Ser Asp Lys Pro 
275 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Ala Lys Lys Ser lie Phe Arg Ala Lys Phe Phe Leu Phe Tyr Arg 
15 10 15 

Thr Glu Phe lie Met Phe Gly Leu Ser Pro Ala Gin Leu lie lie Leu 
20 25 30 
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Leu Val Val lie Leu Leu lie Phe Gly Thr Lys Lys Leu Arg Asn Ala 
35 40 45 

Gly Ser Asp Leu Gly Ala Ala Val Lys Gly Phe Lys Lys Ala Met Lys 
50 55 60 

Glu Asp Glu Lys Val Lys Asp Ala Glu Phe Lys Ser lie Asp Asn Glu 
65 70 75 80 

Thr Ala Ser Ala Lys Lys Gly Lys Tyr Lys Arg Glu Arg Asn Arg Leu 
85 90 95 

Asn Pro Cys Leu lie Leu Val Phe Gin Asn Leu Phe Tyr Xaa Met Val 
100 105 110 

Leu Gly Leu Val Val Leu Gly Pro Lys Arg Leu Pro lie Ala lie Arg 
115 120 125 

Thr Val Met Asp Trp Val Lys Thr lie Arg Gly Leu Ala Ala Asn Val 

130 135 140 

Gin Asn Glu Leu Lys Gin Glu Leu Lys Leu Gin Glu Leu Gin Asp Ser 
145 150 155 160 

lie Lys Lys Ala Glu Ser Leu Asn Leu Gin Ala Leu Ser Pro Glu Leu 
165 170 175 

Ser Lys Thr Val Glu Glu Leu Lys Ala Gin Ala Asp Lys Met Lys Ala 
180 185 190 

Glu Leu Glu Asp Lys Ala Ala Gin Ala Gly Thr Thr Val Glu Asp Gin 
195 200 205 

lie Lys Glu lie Lys Ser Ala Ala Glu Asn Ala Glu Lys Ser Gin Asn 
210 215 220 

Ala lie Ser Val Glu Glu Ala Ala Glu Thr Leu Ser Glu Ala Glu Arg 
225 230 235 240 

Thr Pro Thr Asp Leu Thr Ala Leu Glu Thr His Glu Lys Val Glu Leu 
245 250 255 

Asn Thr His Leu Ser Ser Tyr Tyr Pro Pro Asp Asp lie Glu lie Ala 
260 265 270 

Pro Ala Ser Lys Ser Gin Ser Ser Lys Thr Lys Ser 
275 280 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: xinknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AGTCCTGCAG AATGAAGGGT GATTTATGTG ATTTGCATCA CTTTTGGTGG GTAAATTTAT 60 
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GCAACGCATT 


TGCGTCATGG 


TGATGAGTAT 


CACGAAAAAA 


TGTTAAACCC 


TTCGGT/^G 


120 


TGTCTTTTTG 


CTTCTTCTGA 


CTAAACCGAT 


TCACAGAGGA 


GTTGTATATG 


TCCAAGTCTG 


180 


ATGTTTTTCA 


TCTCGGCCTC 


ACTAAAAACG 


ATTTACAAGG 


GGCTACGCTT 


GCCATCGTCC 


240 


CTGGCGACCC 


GGATCGTGTG 


GAAAAGATCG 


CCGCGCTGAT 


GGATAAGCCG 


GTTAAGCTGG 


300 


CATCTCACCG 


CGAATTCACT 


ACCTGGCGTG 


CAGAGCTGGA 


TGGTAAACCT 


GTTATCGTCT 


360 


GCTCTACCGG 


TATCGGCGGC 


CCGTCTACCT 


CTATTGCTGT 


TGAAGAGCTG 


GCACAGCTGG 


420 


GCATTCGCAC 


CTTCCTGCGT 


ATCGGTACAA 


CGGGCGCTAT 


TCAGCCGCAT 


ATTAATGTGG 


480 


GTGATGTCCT 


GGTTACCACG 


GCGTCTGTCC 


GTCTGGATGG 


CGCGAGCCTG 


CACTTCGCAC 


540 


CGCTGGAATT 


CCCGGCTGTC 


GCTGATTTCG 


AATGTACGAC 


TGCGCTGGTT 


GAAGCTGCGA 


600 


AATCCATTGG 


CGCGACAACT 


CACGTTGGCG 


TGACAGCTTC 


TTCTGATACC 


TTCTACCCAG 


660 


GTCAGGAACG 


TTACGATACT 


TACTCTGGTC 


GCGTAGTTCG 


TCACTTTAAA 


GGTTCTATGG 


720 


AAGAGTGGCA 


GGCGATGGGC 


GTAATGAACT 


ATGAAATGGA 


ATCTGCAACC 


CTGCTGACCA 


780 


TGTGTGCAAG 


TCAGGGCCTG 


CGTGCCGGTA 


TGGTAGCGGG 


TGTTATCGTT 


AACCGCACCC 


840 


AGCAAGAGAT 


CCCGAATGCT 


GAGACGATGA 


AACAAACCGA 


AAGCCATGCG 


GTGAAAATCG 


900 


TGGTGGAAGC 


GGCGCGTCGT 


CTGCTGTAAT 


TCTCTTCTCC 


TGTCTGAAGG 


CCGACGCGTT 


960 


CGGCCTTTTG 


TATTTTTGCG 


TAGCGCCTCG 


CAGGAAATGC 


CTTTCCAACT 


GGACGTTTGT 


1020 


ACAGCACAAT 


TCTATTTTGT 


GCGGGTAAGT 


TGTTGCGTCA 


GGAGGCGTTG 


TGGATTTCTC 


1080 


AATCATGGTT 


TACGCAGTTA 


TTGCGTTGGT 


GGGTGTGGCA 


ATTGGCTGGC 


TGTTTGCCAG 


1140 


TTATCAACAT 


GCGCAGCAAA 


AAGCCGAGCA 


ATTAGCTGAA 


CGTGAAGAGA 


TGGTCGCGGA 


1200 


GTTAAGCGCG 


GCAAAACAAC 


AAATTACCCA 


AAGCGAGCAC 


TGGCGTGCAG 


AGTGCGAGTT 


1260 


ACTCAATAAC 


GAAGTGCGCA 


GCCTGCAAAG 


TATTAACACC 


TCTCTGGAGG 


CCGATCTGCG 


1320 


TGAAGTAACC 


ACGCGGATGG 


AAGCCGCACA 


GCAACATGCT 


QACGATAAAA 


TTCGCCAGAT 


1380 


GATTAACAGC 


GAGCAGCGCC 


TCAGTGAGCA 


GTTTGAAAAC 


CTCGCCAACC 


GTATTTTTGA 


1440 


GCACAGCAAT 


CGCCGGGTTG 


ATGAGCAAAA 


CCGTCAGAGT 


CTGAACAGCC 


TGTTGTCGCC 


1500 


GCTACGTGAA 


CAACTGGACG 


GTTTCCGCCG 


TCAGGTTCAG 


GACAGCTTCG 


GTAAAGAAGC 


1560 


ACAAGAACGC 


CATACCCTGA 


CCCACGAAAT 


TCGCAATCTC 


CAGCAACTCA 


ACGCGCAAAT 


1620 


GGCCCAGGAA 


GCGATCAACC 


TGACGCGCGC 


GCTGAAAGGC 


GACAATAAAA 


CCCAGGGCAA 


1680 


CTGGGGCGAG 


GTAGTATTGA 


CGCGGGTGCT 


GGAGGCTTCC 


GGTCTGCGTG 


AAGGGTATGA 


1740 


ATATGAAACC 


CAGGTCAGCA 


TCGAAAATGA 


CGCCCGCTCG 


CGGATGCAGC 


CGGATGTCT^T 


1800 


CGTGCGCCTG 


CCGCAGGGAA 


AAGATGTGGT 


GATCGACGCC 


AAAATGACGC 


TGGTCGCCTA 


1860 


TGAACGCTAT 


TTTAACGCCG 


AAGACGACTA 


CACCCGCGAA 


AGCGCGCTAC 


AGGAACATAT 


1920 
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CGCGTCGGTG 


CGTAACCT^TA 


TCCGTTTGCT 


GGGACGCAAA 


GATTATCAAC 


AGCTGCCGGG 


1980 


GCTGCGAACT 


CTGGATTACG 


TGCTGATGTT 


TATTCCCGTT 


GAACCCGCTT 


TTTTACTGGC 


2040 


GCTTGACCGC 


CAGCCGGAGC 


TGATCACCGA 


AGCGTTGAAA 


AACAACATCA 


TGCTGGTTAG 


2100 


CCCGACTACG 


CTGCTGGTGG 


CGCTGCGCAC 


TATCGCCAAC 


CTGTGGCGTT 


ATGAGCATCA 


2160 


AAGCCGCAAC 


GCCCAGCAAA 


TCGCCGATCG 


TGCCAGCAAG 


CTGTACGACA 


AGATGCGTTT 


2220 


GTTCATCGAT 


GACATGTCCG 


CGATTGGTCA 


AAGTCTCGAC 


AAAGCGCAGG 


ATAATTATCG 


2280 


GCAGGCAATG 


AAAAAACTCT 


CTTCAGGGCG 


CGGAAATGTG 


CTGGCGCAGG 


CAGAAGCGTT 


2340 


TCGCGGTTTA 


GGAGTAGAAA 


TTAAACGCGA 


GATTAATCCG 


GATTTGGCTG 


AACAGGCGGT 


2400 


GAGCCAGGAT 


GAAGAGTATC 


GACTTCGGTC 


GGTTCCGGAG 


CAGCCGAATG 


ATGAAGCTTA 


2460 


TCAACGCGAT 


GATGAATATA 


ATCAGCAGTC 


GCGCTAGCCC 


ATTGGGAGTA 


GTTAAGCCGG 


2520 


GTAGAAATCT 


AGGGCATCGA 


CGCCCAATCT 


GTTACACTTC 


TGGAACAATT 


TTTTGATGAG 


2580 


CAGGCATTGA 


GATGGTGGAT 


AAGTCACAAG 


AAACGACGCA 


CTTTGGTTTT 


CAGACCGTCG 


2640 


CGAAGGAACA 


AAAAGCGGAT 


ATGGTCGCCC 


ACGTTTTCCA 


TTCCGTGGCA 


TCAAAATACG 


2700 


ATGTCATGAA 


TGATTTGATG 


TCATTTGGTA 


TTCATCGTTT 


GTGGAAGCGA 


TTCACGATTG 


2760 


ATTGCAGCGG 


CGTACGCCGT 


GGGCAGACCG 


TGCTGGATCT 


GGCTGGTGGC 


ACCGGCGACC 


2820 


TGACAGCGAA 


ATTCTCCCGC 


CTGGTCGGAG 


AAACTGGCAA 


AGTGGTCCTT 


GCTGATATCA 


2880 


ATGAATCCAT 


GCCCAAAATG 


GGCCGCGAGA 


AGCTGCGTAA 


TATCGGTGTG 


ATTGGCAACG 


2940 


TTGAGTATGT 


TCAGGCGAAC 


GCTGAGGCGC 


TGCCGTTCCC 


GGATAACACC 


TTTGATTGCA 


3000 


TCACCATTTC 


GTTTGGTCTG 


CGTAACGTCA 


CCGACAAAGA 


TAAAGCACTG 


CGTTCAATGT 


3060 


ATCGCGTGCT 


GAAACCCGGC 


GGCCGCCTGC 


TGGTGCTTGA 


GTTCTCGAAG 


CCAATTATCG 


3120 


AGCCGCTGAG 


CAAAGCCTAT 


GATGCATACT 


CCTTCCATGT 


GCTGCCGCGT 


ATTGGCTCAC 


3180 


TGGTCGCGAA 


CGACGCCGAC 


AGCTACCGTT 


ATCTGGCAGA 


ATCCATCCGT 


ATGCATCCCG 


3240 


ATCAGGATAC 


CCTGAAAGCC 


ATGATGCAGG 


ATGCCGGATT 


CGAAAGTGTC 


GACTACTACA 


3300 


ATCTGACGGC 


AGGGGTTGTG 


GCGCTGCATC 


GTGGTTATAA 


GTTCTGACAG 


GAGACCGGAA 


3360 


ATGCCTTTTA 


AACCTTTAGT 


GACGGCAGGA 


ATTGAAAGTC 


TGCTCAACAC 


CTTCCTGTAT 


3420 


CGCTCACCCG 


CGCTGAAAAC 


GGCCCGCTCG 


CGTCTGCTGG 


GTAAAGTATT 


GCGCGTGGAG 


3480 


GTAAAAGGCT 


TTTCGACGTC 


ATTGATTCTG 


GTGTTCAGCG 


AACGCCAGGT 


TGATGTACTG 


3540 


GGCGAATGGG 


CAGGCGATGC 


TGACTGCACC 


GTTATCGCCT 


ACGCCAGTGT 


GTTGCCGAAA 


3600 


CTTCGCGATC 


GCCAGCT^GCT 


TACCGCACTG 


ATTCGCAGTG 


GTGAGCTGGA 


AGTGCAGGGC 


3660 


GATATTCAGG 


TGGTGCAAAA 


CTTCGTTGCG 


CTGGCAGATC 


TGGCAGAGTT 


CGACCCTGCG 


3720 


GAACTGCTGG 


CCCCTTATAC 


CGGTGATATC 


GCCGCTGAAG 


GAATCAGCAA 


AGCCATGCGC 


3780 
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GGA6GCGCAA 


AGTTCCTGCA 


TCACGGCATT 


AAGCGCCAGC 


AACGTTATGT 


GGCGGAAGCC 


3840 


ATTACTGAAG 


AGTGGCGTAT 


GGCACCCGGT 


CCGCTTGAAG 


TGGCCTGGTT 


TGCGGAAGAG 


3900 


ACGGCTGCCG 


TCGAGCGTGC 


TGTTGATGCC 


CTGACCAAAC 


GGCTGG7VAAA 


ACTGGAGGCT 


3960 


AAATGACGCC 


AGGTGAAGTA 


CGGCGCCTAT 


ATTTCATCAT 


TCGCACTTTT 


TTAAGCTACG 


4020 


GACTTGATGA 


ACTGATCCCC 


AAAATGCGTA 


TCACCCTGCC 


GCTACGGCTA 


TGGCGATACT 


4080 


CATTATTCTG 


GATGCCAAAT 


CGGCATAAAG 


ACAAACTTTT 


AGGTGAGCGA 


CTACGACTGG 


4140 


CCCTGCAAGA 


ACTGGGGCCG 


GTTTGGATCA 


AGTTCGGGCA 


AATGTTATCA 


ACCCGCCGCG 


4200 


ATCTTTTTCC 


ACCGCATATT 


GCCGATCAGC 


TGGCGTTATT 


GCAGGACAAA 


GTTGCTCCGT 


4260 


TTGATGGCAA 


GCTGGCGAAG 


CAGCAGATTG 


AAGCTGCAAT 


GGGCGGCTTG 


CCGGTAGAAG 


4320 


CGTGGTTTGA 


CGATTTTGAA 


ATCAAGCCGC 


TGGCTTCTGC 


TTCTATCGCC 


CAGGTTCATA 


4380 


CCGCGCGATT 


GAAATCGAAT 


GGTAAAGAGG 


TGGTGATTAA 


AGTCATCCGC 


CCGGATATTT 


4440 


TGCCGGTTAT 


TAAAGCGGAT 


CTGAAACTTA 


TCTACCGTCT 


GGCTCGCTGG 


GTGCCGCGTT 


4500 


TGCTGCCGGA 


TGGTCGCCGT 


CTGCGCCCAA 


CCGAAGTGGT 


GCGCGAGTAC 


GAAAAGACAT 


4560 


TGATTGATGA 


ACTGAATTTG 


CTGCGGGAAT 


CTGCCAACGC 


CATTCAGCTT 


CGGCGCAATT 


4620 


TTGAAGACAG 


CCCGATGCTC 


TACATCCCGG 


AAGTTTACCC 


TGACTATTGT 


AGTGAAGGGA 


4680 


TGATGGTGAT 


GGAGCGCATT 


TACGGCATTC 


CGGTGTCTGA 


TGTTGCGGCG 


CTGGAGAAAA 


4740 


ACGGCACTAA 


CATGAAATTG 


CTGGCGGAAC 


GCGGCGTGCA 


GGTGTTCTTC 


ACTCAGGTCT 


4800 


TTCGCGACAG 


CTTTTTCCAT 


GCCGATATGC 


ACCCTGGCAA 


CATCTTCGTA 


AGCTATGAAC 


4860 


ACCCGGAAAA 


CCCGAAATAT 


ATCGGCATTG 


ATTGCGGGAT 


TGTTGGCTCG 


CTAAACAAAG 


4920 


AAGATAAACG 


CTATCTGGCA 


GAAAACTTTA 


TCGCCTTCTT 


TAATCGCGAC 


TATCGCAAAG 


4980 


TGGCAGAGCT 


ACACGTCGAT 


TCTGGCTGGG 


TGCCACCAGA 


TACCAACGTT 


GAAGAGTTCG 


5040 


AATTTGCCAT 


TCGTACGGTC 


TGTGAACCTA 


TCTTTGAGAA 


ACCGCTGGCC 


G/VAATTTCGT 


5100 


TTGGACATGT 


ACTGTTAAAT 


CTGTTTAATA 


CGGCGCGTCG 


CTTCAATATG 


GAAGTGCAGC 


5160 


CGCAACTGGT 


GTTACTCCAG 


AAAACCCTGC 


TCTACGTCGA 


AGGGGTAGGA 


CGCCAGCTTT 


5220 


ATCCGCAACT 


CGATTTATGG 


AAAACGGCGA 


AGCCTTTCCT 


GGAGTCGTGG 


ATTAAAGATC 


5280 


AGGTCGGTAT 


TCCTGCGCTG 


GTGAGAGCAT 


TTAAAGAAAA 


AGCGCCGTTC 


TGGGTCGAAA 


5340 


AAATGCCAGA 


ACTGCCTGAA 


TTGGTTTACG 


ACAGTTTGCG 


CCAGGGCAAG 


TATTTACAGC 


5400 


ACAGTGTTGA 


TAAGATTGCC 


CGCGAGCTTC 


AGTCAAATCA 


TGTACGTCAG 


GGACAATCGC 


5460 


GTTATTTTCT 


CGGAATTGGC 


GCTACGTTAG 


TATTAAGTGG 


CACATTCTTG 


TTGGTCAGCC 


5520 


GACCTGAATG 


GGGGCTGATG 


CCCGGCTGGT 


TAATGGCAGG 


TGGTCTGATC 


GCCTGGTTTG 


5580 


TCGGTTGGCG 


CAAAACACGC 


TGATTTTTTC 


ATCGCTCAAG 


GCGGGCCGTG 


TAACGTATAA 


5640 
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TGCGGCTTTG 


TTTAATCATC 


ATCTACCACA 


GAGGAACATG 


TATGGGTGGT 


ATCAGTATTT 


5700 


GGCAGTTATT 


GATTATTGCC 


GTCATCGTTG 


TACTGCTTTT 


TGGCACCAAA 


AAGCTCGGCT 


5760 


CCATCGGTTC 


CGATCTTGGT 


GCGTCGATCA 


AAGGCTTTAA 


AAAAGCAATG 


AGCGATGATG 


5820 


AACCAAAGCA 


GGATAAAACC 


AGTCAGGATG 


CTGATTTTAC 


TGCGAAAACT 


ATCGCCGATA 


5880 


AGCAGGCGGA 


TACGAATCAG 


GAACAGGCTA 


AAACAGAAGA 


CGCGAAGCGC 


CACGATAAAG 


5940 


AGCAGGTGAA 


TCCGTGTTTG 


ATATCGGTTT 


TAGCGAACTT 


GCTATTGGTG 


TTCATCATCG 


6000 


GCCTCGTCGT 


TCTGGGGCCG 


CAACGACTGC 


CTGTGGCGGT 


AAAAACGGTA 


GCGGGCTGGA 


6060 


TTCGCGCGTT 


GCGTTCACTG 


GCGACT^CGG 


TGCAGAACGA 


ACTGACCCAG 


GAGTTAAAAC 


6120 


TCCAGGAGTT 


TCAGGACAGT 


CTGAAAAAGG 


TTGAAAAGGC 


GAGCCTCACT 


AACCTGACGC 


6180 


CCGAACTGAA 


AGCGTCGATG 


GATGAACTAC 


GCCAGGCCGC 


GGAGTCGATG 


AAGCGTTCCT 


6240 


ACGTTGCAAA 


CGATCCTGAA 


AAGGCGAGCG 


ATGAAGCGCA 


CACCATCCAT 


AACCCGGTGG 


6300 


TGAAAGATAA 


TGAAGCTGCG 


CATGAGGGCG 


TAACGCCTGC 


CGCTGCACAA 


ACGCAGGCCA 


6360 


GTTCGCCGGA 


ACAGAAGCCA 


GAAACCACGC 


CAGAGCCGGT 


GGTAAAACCT 


GCTGCGGACG 


6420 


CTGAACCGAA 


AACCGCTGCA 


CCTTCCCCTT 


CGTCGAGTGA 


TAAACCGTAA 


ACATGTCTGT 


6480 


AGAAGATACT 


CAACCGCTTA 


TCACGCATCT 


GATTGAGCTG 


CGTAAGCGTC 


TGCTGAACTG 


6540 


CATTATCGCG 


GTGATCGTGA 


TATTCCTGTG 


TCTGGTCTAT 


TTCGCCAATG 


ACATCTATCA 


6600 


CCTGGTATCC 


GCGCCATTGA 


TCAAGCAGTT 


GCCGCAAGGT 


TCAACGATGA 


TCGCCACCGA 


6660 


CGTGGCCTCG 


CCGTTCTTTA 


CGCCGATCAA 


GCTGACCTTT 


ATGGTGTCGC 


TGATTCTGTC 


6720 


AGCGCCGGTG 


ATTCTCTATC 


AGGTGTGGGC 


ATTTATCGCC 


CCAGCGCTGT 


ATAAGCATGA 


6780 


ACGTCGCCTG 


GTGGTGCCGC 


TGCTGGTTTC 


CAGCTCTCTG 


CTGTTTTATA 


TCGGCATGGC 


6840 


ATTCGCCTAC 


TTTGTGGTCT 


TTCCGCTGGC 


ATTTGGCTTC 


CTTGCCAATA 


CCGCGCCGGA 


6900 


AGGGGTGCAG 


GTATCCACCG 


ACATCGCCAG 


CTATTTAAGC 


TTCGTTATGG 


CGCTGTTTAT 


6960 


GGCGTTTGGT 


GTCTCCTTTG 


AAGTGCCGGT 


AGCAATTGTG 


CTGCTGTGCT 


GGATGGGGAT 


7020 


TACCTCGCCA 


GAAGACTTAC 


GCAPiMAACG 


CCCGTATGTG 


CTGGTTGGTG 


CATTCGTTGT 


7080 


CGGGATGTTG 


CTGACGCCGC 


CGGATGTCTT 


CTCGCAAACG 


CTGTTGGCGA 


TCCCGATGTA 


7140 


CTGTCTGTTT 


GAAATCGGTG 


TCTTCTTCTC 


ACGCTTTTAC 


GTTGGTAAAG 


GGCGAAATCG 


7200 


GGAAGAGGAA 


AACGACGCTG 


AAGCAGAAAG 


CGAAAAAACT 


GAAGAATAAA 


TTCAACCGCC 


7260 


CGTCAGGGCG 


GTTGTCATAT 


GGAGTACAGG 


ATGTTTGATA 


TCGGCGTTAA 


TTTGACCAGT 


7320 


TCGCAATTTG 


CGAAAGACCG 


TGATGATGTT 


GTAGCGTGCG 


CTTTTGACGC 


GGGAGTTAAT 


7380 


GGGCTACTCA 


TCACCGGCAC 


TAACCTGCGT 


GAAAGCCAGC 


AGGCGCAAAA 


GCTGGCGCGT 


7440 


CAGTATTCGT 


CCTGTTGGTC 


AACGGCGGGC 


GTACATCCTC 


ACGACAGCAG 


CCAGTGGCAA 


7500 
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GCTGCGACTG 


AAGAAGCGAT 


TATTGAGCTG 


GCCGCGCAGC 


CAGAAGTGGT 


GGCGATTGGT 


7560 


GAATGTGGTC 


TCGACTTTAA 


CCGCAACTTT 


TCGACGCCGG 


AAGAGCAGGA 


ACGCGCTTTT 


7620 


GTTGCCCAGC 


TACGCATTGC 


CGCAGATTTA 


AACATGCCGG 


TATTTATGCA 


CTGTCGCGAT 


7680 


GCCCACGAGC 


GGTTTATGAC 


ATTGCTGGAG 


CCGTGGCTGG 


ATAAACTGCC 


TGGTGCGGTT 


7740 


CTTCATTGCT 


TTACCGGCAC 


ACGCGAAGAG 


ATGCAGGCGT 


GCGTGGCGCA 


TGGAATTTAT 


7800 


ATCGGCATTA 


CCGGTTGGGT 


TTGCGATGAA 


CGACGCGGAC 


TGGAGCTGCG 


GGAACTTTTG 


7860 


CCGTTGATTC 


CGGCGGAAAA 


ATTACTGATC 


GAAACTGATG 


CGCCGTATCT 


GCTCCCTCGC 


7920 


GATCTCACGC 


CAAAGCCATC 


ATCCCGGCGC 


AACGAGCCAG 


CCCATCTGCC 


CCATATTTTG 


7980 


CAACGTATTG 


CGCACTGGCG 


TGGAGAAGAT 


GCCGCATGGC 


TGGCTGCCAC 


CACGGATGCT 


8040 


AATGTCAAAA 


CACTGTTTGG 


GATTGCGTTT 


TAGAGTTTGC 


GGAACTCGGT 


ATTCTTCACA 


8100 


CTGTGCTTAA 


TCTCTTTATT 


AATAAGATTA 


AGCAATAGCA 


TGGAGCGAGC 


CTCACCATCG 


8160 


GGTTCGGTGA 


AAATGGCCTG 


AAAGCCTTCG 


AACGCGCCTT 


CGGTAATAAT 


CACCTTATCA 


8220 


CCCGGATAAG 


GGGTTGCCGG 


ATCGACAATG 


TCTTTCGGTT 


TATATACCGA 


TAGCTGATGA 


8280 


ATAACCGCCG 


ATGGGACTAT 


CGCTGGCGAC 


GCGCCAAAGC 


GCACGAAGTG 


GCTGACACCG 


8340 


CGGGTCGCGT 


TGATAGTCGT 


GGTATGAATC 


ACTTCTGGGT 


CAAATTCCAC 


AAACAGGTAG 


8400 


TTGGGGAACA 


ATGGCTCACT 


GACTGCAGTA 


CGTTTTCCAC 


GCACGATTTT 


TTCCAGGGTG 


8460 


ATCATCGGTG 


CCAGGCAATT 


CACAGCCTGT 


CTTTCGAGGT 


GTTCCTGGGC 


ACGTTGAAGT 


8520 


TGCCCGCGCT 


TGCAGTACAG 


TAAATACCAG 


GATTGCATAA 


TGACTCTTAT 


CCGTTTAATC 


8580 


GGGGCGCAAG 


GATAGCAAAA 


GCTTTACGCT 


AAGTTAATTA 


TATTCCCCGG 


TTTGCGTTAT 


8640 


ACCGTCAGAG 


TTCACGCTAA 


TTTAACAAAT 


TTACAGCATC 


GCAAAGATGA 


ACGCCGTATA 


8700 


ATGGGCGCAG 


ATTAAGAGGC 


TACAATGGAC 


GCCATGAAAT 


ATAACGATTT 


ACGCGACTTC 


8760 


TTGACGCTGC 


TTGAACAGCA 


GGGTGAGCTA 


AAACGTATCA 


CGCTCCCGGT 


GGATCCGCAT 


8820 


CTGGAAATCA 


CTGAAATTGC 


TGACCGCACT 


TTGCGTGCCG 


GTGGGCCTGC 


GCTGTTGTTC 


8880 


GAAT^CCCTA 


AAGGCTACTC 


AATGCCGGTG 


CTGTGCAACC 


TGTTCGGTAC 


GCCAAAGCGC 


8940 


GTGGCGATGG 


GCATGGGGCA 


GGAAGATGTT 


TCGGCGCTGC 


GTGAAGTTGG 


TAAATTATTG 


9000 


GCGTTTCTGA 


AAGAGCCGGA 


GCCGCCAAAA 


GGTTTCCGCG 


ACCTGTTTGA 


TAAACTGCCG 


9060 


CAGTTTAAGC 


AAGTATTGAA 


CATGCCGACA 


AAGCGGCTGC 


GTGGTGCGCC 


CTGCCAACAA 


9120 


AAAATCGTCT 


CTGGCGATGA 


CGTCGATCTC 


AATCGCATTC 


CCATTATGAC 


CTGCTGGCCG 


9180 


GAAGATGCCG 


CGCCGCTGAT 


TACCTGGGGG 


CTGACAGTGA 


CGCGCGGCCC 


ACATAAAGAG 


9240 


CGGCAGAATC 


TGGGCATTTA 


TCGCCAGCAG 


CTGATTGGTA 


AAAACAAACT 


GATTATGCGC 


9300 


TGGCTGTCGC 


ATCGCGGCGG 


CGCGCTGGAT 


TATCAGGAGT 


GGTGTGCGGC 


GCATCCGGGC 


9360 
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GAACGTTTCC 


CGGTTTCTGT 


GGCGCTGGGT 


GCCGATCCCG 


CCACGATTCT 


CGGTGCAGTC 


9420 


ACTCCCGTTC 


CGGATACGCT 


TTCAGAGTAT 


GCGTTTGCCG 


GATTGCTACG 


TGGCACCAAG 


9480 


ACCGAAGTGG 


TGAAGTGTAT 


CTCCAATGAT 


CTTGAAGTGC 


CCGCCAGTGC 


GGAGATTGTG 


9540 


CTGGAAGGGT 


ATATCGAACA 


AGGCGAAACT 


GCGCGGGAAG 


GGCCGTATGG 


CGACCACACC 


9600 


GGTTACTATA 


ATGAAGTCGA 


TAGTTTCCCG 


GTATTTACCG 


TGACGCATAT 


TACCCAGCGT 


9660 


GAAGATGCGA 


TTTACCATTC 


CACCTATACC 


GGGCGTCCGC 


CAGATGAGCC 


cgcggtgctg 


9720 


GGTGTCGCAC 


TGAACGAAGT 


GTTTGTGCCG 


ATTCTGCAAA 


AACAGTTCCC 


GGAAATTGTC 


9780 


GATTTTTACC 


TGCCGCCGGA 


AGGCTGCTCT 


TATCGCCTGG 


CGGTAGTGAC 


AATCAAAAAA 


9840 


CAGTACGCCG 


GACACGCGAA 


GCGCGTCATG 


ATGGGCGTCT 


GGTCGTTCTT 


ACGCCAGTTT 


9900 


ATGTACACTA 


AATTTGTGAT 


CGTTTGCGAT 


GATGACGTTA 


ACGCACGCGA 


CTGGAACGAT 


9960 


GTGATTTGGG 


CGATTACCAC 


CCGTATGGAC 


CCGGCGCGGG 


ATACTGTTCT 


GGTAGAAAAT 


10020 


ACGCCTATTG 


ATTATCTGGA 


TTTTGCCTCG 


CCTGTCTCCG 


GGCTGGGTTC 


AAAAATGGGG 


10080 


CTGGATGCCA 


CGAATAAATG 


GCCGGGGGAA 


ACCCAGCGTG 


AATGGGGACG 


TCCCATCAAA 


10140 


AAAGATCCAG 


ATGTTGTCGC 


GCATATTGAC 


GCCATCTGGG 


ATGAACTGGC 


TATTTTTAAC 


10200 


AACGGTAAAA 


GCGCCTGATG 


CGCGTTTGTT 


TTGCCCTATT 


TATCGATCCG 


ACAGAGAAAG 


10260 


CGCATGACAA 


CCTTAAGCTG 


TAAAGTGACC 


TCGGTAGAAG 


CTATCACGGA 


TACCGTATAT 


10320 


CGTGTCCGCA 


TCGTGCCAGA 


CGCGGCCTTT 


TCTTTTCGTG 


CTGGTCAGTA 


TTTGATGGTA 


10380 


GTGATGGATG 


AGCGCGACAA 


ACGTCCGTTC 


TCAATGGCTT 


CGACGCCGGA 


TGAAAAAGGG 


10440 


TTTATCGAGC 


TGCATATTGG 


CGCTTCTGAA 


ATCAACCTTT 


ACGCGAAAGC 


AGTCATGGAC 


10500 


CGCATCCTCA 


AAGATCATCA 


AATCGTGGTC 


GACATTCCCC 


ACGGAGAAGC 


GTGGCTGCGC 


10560 


GATGATGAAG 


AGCGTCCGAT 


GATTTTGATT 


GCGGGCGGCA 


CCGGGTTCTC 


TTATGCCCGC 


10620 


TCGATTTTGC 


TGACAGCGTT 


GGCGCGTAAC 


CCAAACCGTG 


ATATCACCAT 


TTACTGGGGC 


10680 


GGGCGTGAAG 


AGCAGCATCT 


GTATGATCTC 


TGCGAGCTTG 


AGGCGCTTTC 


GTTGAAGCAT 


10740 


CCTGGTCTGC 


AAGTGGTGCC 


GGTGGTTGAA 


CAACCGGAAG 


CGGGCTGGCG 


TGGGCGTACT 


10800 


GGCACCGTGT 


TAACGGCGGT 


ATTGCAGGAT 


CACGGTACGC 


TGGCAGAGCA 


TGATATCTAT 


10860 


ATTGCCGGAC 


GTTTTGAGAT 


GGCGAAAATT 


GCCCGCGATC 


TGTTTTGCAG 


TGAGCGTAAT 


10920 


GCGCGGGAAG 


ATCGCCTGTT 


TGGCGATGCG 


TTTGCATTTA 


TCTGAGATAT 


AAAAAAACCC 


10980 


GCCCCTGACA 


GGCGGGAAGA 


ACGGCAACTA 


AACTGTTATT 


CAGTGGCATT 


TAGATCTATG 


11040 


ACGTATCTGG 


CAAAAGTCCT 


GCAGAATGAA 


GGGTGATTTA 


TGTGATTTGC 


ATCACTTTTG 


11100 


GTGGGTAAAT 


TTATGCAACG 


CATTTGCGTC 


ATGGTGATGA 


gtatcacg;^ 


AAT^TGTTAA 


11160 


ACCCTTCGGT 


AAAGTGTCTT 


TTTGCTTCTT 


CTGACTAAAC 


CGATTCACAG 


AGGAGTTGTA 


11220 
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TATGTCCAAG 


TCTGATGTTT 


TTCATCTCGG 


CCTCACTAAA 


AACGATTTAC 


AAGGGGCTAC 


11280 


GCTTGCCATC 


GTCCCTGGCG 


ACCCGGATCG 


TGTGGAAAAG 


ATCGCCGCGC 


TGATGGATAA 


11340 


GCCGGTTAAG 


CTGGCATCTC 


ACCGCGAATT 


CACTACCTGG 


CGTGCAGAGC 


TGGATGGTAA 


11400 


ACCTGTTATC 


GTCTGCTCTA 


CCGGTATCGG 


CGGCCCGTCT 


ACCTCTATTG 


CTGTTGAAGA 


11460 


GCTGGCACAG 


CTGGGCATTC 


GCACCTTCCT 


GCGTATCGGT 


ACAACGGGCG 


CTATTCAGCC 


11520 


GCATATTAAT 


GTGGGTGATG 


TCCTGGTTAC 


CACGGCGTCT 


GTCCGTCTGG 


ATGGCGCGAG 


11580 


CCTGCACTTC 


GCACCGCTGG 


AATTCCCGGC 


TGTCGCTGAT 


TTCGAATGTA 


CGACTGCGCT 


11640 


GGTTGAAGCT 


GCGAAATCCA 


TTGGCGCGAC 


AACTCACGTT 


GGCGTGACAG 


CTTCTTCTGA 


11700 


TACCTTCTAC 


CCAGGTCAGG 


AACGTTACGA 


TACTTACTCT 


GGTCGCGTAG 


TTCGTCACTT 


11760 


TAAAGGTTCT 


ATGGAAGAGT 


GGCAGGCGAT 


GGGCGTAATG 


AACTATGAAA 


TGGAATCTGC 


11820 


AACCCTGCTG 


ACCATGTGTG 


CAAGTCAGGG 


CCTGCGTGCC 


GGTATGGTAG 


CGGGTGTTAT 


11880 


CGTTAACCGC 


ACCCAGCAAG 


AGATCCCGAA 


TGCTGAGACG 


ATGAAACAAA 


CCGAAAGCCA 


11940 


TGCGGTGAAA 


ATCGTGGTGG 


AAGCGGCGCG 


TCGTCTGCTG 


TAATTCTCTT 


CTCCTGTCTG 


12000 


AAGGCCGACG 


CGTTCGGCCT 


TTTGTATTTT 


TGCGTAGCGC 


CTCGCAGGAA 


ATGCCTTTCC 


12060 


AACTGGACGT 


TTGTACAGCA 


CAATTCTATT 


TTGTGCGGGT 


AAGTTGTTGC 


GTCAGGAGGC 


12120 


GTTGTGGATT 


TCTCAATCAT 


GGTTTACGCA 


GTTATTGCGT 


TGGTGGGTGT 


GGCAATTGGC 


12180 


TGGCTGTTTG 


CCAGTTATCA 


ACATGCGCAG 


CAAAAAGCCG 


AGCAATTAGC 


TGAACGTGAA 


12240 


GAGATGGTCG 


CGGAGTTAAG 


CGCGGCAAAA 


CAACAAATTA 


CCCAAAGCGA 


GCACTGGCGT 


12300 


GCAGAGTGCG 


AGTTACTCAA 


TT^CGAAGTG 


CGCAGCCTGC 


AAAGTATTAA 


CACCTCTCTG 


12360 


GAGGCCGATC 


TGCGTGAAGT 


AACCACGCGG 


ATGGAAGCCG 


CACAGCAACA 


TGCTGACGAT 


12420 


AAAATTCGCC 


AGATGATTAA 


CAGCGAGCAG 


CGCCTCAGTG 


AGCAGTTTGA 


AAACCTCGCC 


12480 


AACCGTATTT 


TTGAGCACAG 


CAATCGCCGG 


GTTGATGAGC 


AAAACCGTCA 


GAGTCTGAAC 


12540 


AGCCTGTTGT 


CGCCGCTACG 


TGAACAACTG 


GACGGTTTCC 


GCCGTCAGGT 


TCAGGACAGC 


12600 


TTCGGTAAAG 


AAGCACAAGA 


ACGCCATACC 


CTGACCCACG 


AAATTCGCAA 


TCTCCAGCAA 


12660 


CTCAACGCGC 


AAATGGCCCA 


GGAAGCGATC 


AACCTGACGC 


GCGCGCTGAA 


AGGCGACAAT 


12720 


AAAACCCAGG 


GCAACTGGGG 


CGAGGTAGTA 


TTGACGCGGG 


TGCTGGAGGC 


TTCCGGTCTG 


12780 


CGTGAAGGGT 


ATGAATATGA 


AACCCAGGTC 


AGCATCGAAA 


ATGACGCCCG 


CTCGCGGATG 


12840 


CAGCCGGATG 


TCATCGTGCG 


CCTGCCGCAG 


GGAAAAGATG 


TGGTGATCGA 


CGCCAAAATG 


12900 


ACGCTGGTCG 


CCTATGAACG 


CTATTTTAAC 


GCCGAAGACG 


ACTACACCCG 


CGAAAGCGCG 


12960 


CTACAGGAAC 


ATATCGCGTC 


GGTGCGTAAC 


CATATCCGTT 


TGCTGGGACG 


CAAAGATTAT 


13020 


CAACAGCTGC 


CGGGGCTGCG 


AACTCTGGAT 


TACGTGCTGA 


TGTTTATTCC 


CGTTGAACCC 


13080 
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GCTTTTTTAC 


TGGCGCTTGA 


CCGCCAGCCG 


GAGCTGATCA 


CCGAAGCGTT 


GAAAAACAAC 


13140 


ATCATGCTGG 


TTAGCCCGAC 


TACGCTGCTG 


GTGGCGCTGC 


GCACTATCGC 


CAACCTGTGG 


13200 


CGTTATGAGC 


ATCAAAGCCG 


CAACGCCCAG 


CAAATCGCCG 


ATCGTGCCAG 


CAAGCTGTAC 


13260 


GACAAGATGC 


GTTTGTTCAT 


CGATGACATG 


TCCGCGATTG 


GTCAAAGTCT 


CGACAAAGCG 


13320 


CAGGATAATT 


ATCGGCAGGC 


AATGAAAAAA 


CTCTCTTCAG 


GGCGCGGAAA 


TGTGCTGGCG 


13380 


CAGGCAGAAG 


CGTTTCGCGG 


TTTAGGAGTA 


GAAATTAAAC 


GCGAGATTAA 


TCCGGATTTG 


13440 


GCTGAACAGG 


CGGTGAGCCA 


GGATGAAGAG 


TATCGACTTC 


GGTCGGTTCC 


GGAGCAGCCG 


13500 


AATGATGAAG 


CTTATCAACG 


CGATGATGAA 


TATAATCAGC 


AGTCGCGCTA 


GCCCATTGGG 


13560 


AGTAGTTAAG 


CCGGGTAGAA 


ATCTAGGGCA 


TCGACGCCCA 


ATCTGTTACA 


CTTCTGGAAC 


13620 


AATTTTTTGA 


TGAGCAGGCA 


TTGAGATGGT 


GGATAAGTCA 


CAAGAAACGA 


CGCACTTTGG 


13680 


TTTTCAGACC 


GTCGCGAAGG 


AACAAAAAGC 


GGATATGGTC 


GCCCACGTTT 


TCCATTCCGT 


13740 


GGCATCAAAA 


TACGATGTCA 


TGAATGATTT 


GATGTC7VTTT 


GGTATTCATC 


GTTTGTGGAA 


13800 


GCGATTCACG 


ATTQATTGCA 


GCGGCGTACG 


CCGTGGGCAG 


ACCGTGCTGG 


ATCTGGCTGG 


13860 


TGGCACCGGC 


GACCTGACAG 


CGAAATTCTC 


CCGCCTGGTC 


GGAGAAACTG 


GCAAAGTGGT 


13920 


CCTTGCTGAT 


ATCAATGAAT 


CCATGCCCAA 


AATGGGCCGC 


GAGAAGCTGC 


GTAATATCGG 


13980 


TGTGATTGGC 


AACGTTGAGT 


ATGTTCAGGC 


GAACGCTGAG 


GCGCTGCCGT 


TCCCGGATAA 


14040 


CACCTTTGAT 


TGCATCACCA 


TTTCGTTTGG 


TCTGCGTAAC 


GTCACCGACA 


AAGATAAAGC 


14100 


ACTGCGTTCA 


ATGTATCGCG 


TGCTGAAACC 


CGGCGGCCGC 


CTGCTGGTGC 


TTGAGTTCTC 


14160 


GAAGCCAATT 


ATCGAGCCGC 


TGAGCAAAGC 


CTATGATGCA 


TACTCCTTCC 


ATGTGCTGCC 


14220 


GCGTATTGGC 


TCACTGGTCG 


CGAACGACGC 


CGACAGCTAC 


CGTTATCTGG 


CAGAATCCAT 


14280 


CCGTATGCAT 


CCCGATCAGG 


ATACCCTGAA 


AGCCATGATG 


CAGGATGCCG 


GATTCGAAAG 


14340 


TGTCGACTAC 


TACAATCTGA 


CGGCAGGGGT 


TGTGGCGCTG 


CATCGTGGTT 


ATAAGTTCTG 


14400 


ACAGGAGACC 


GGAAATGCCT 


TTTAAACCTT 


TAGTGACGGC 


AGGAATTGAA 


AGTCTGCTCA 


14460 


ACACCTTCCT 


GTATCGCTCA 


CCCGCGCTGA 


AAACGGCCCG 


CTCGCGTCTG 


CTGGGTAAAG 


14520 


TATTGCGCGT 


GGAGGTAAAA 


GGCTTTTCGA 


CGTCATTGAT 


TCTGGTGTTC 


AGCGAACGCC 


14580 


AGGTTGATGT 


ACTGGGCGAA 


TGGGCAGGCG 


ATGCTGACTG 


CACCGTTATC 


GCCTACGCCA 


14640 


GTGTGTTGCC 


GAAACTTCGC 


GATCGCCAGC 


AGCTTACCGC 


ACTGATTCGC 


AGTGGTGAGC 


14700 


TGGAAGTGCA 


GGGCGATATT 


CAGGTGGTGC 


AAAACTTCGT 


TGCGCTGGCA 


GATCTGGCAG 


14760 


AGTTCGACCC 


TGCGGAACTG 


CTGGCCCCTT 


ATACCGGTGA 


TATCGCCGCT 


GAAGGAATCA 


14820 


GCAAAGCCAT 


GCGCGGAGGC 


GCAAAGTTCC 


TGCATCACGG 


CATTAAGCGC 


CAGCAACGTT 


14880 


ATGTGGCGGA 


AGCCATTACT 


GAAGAGTGGC 


GTATGGCACC 


CGGTCCGCTT 


GAAGTGGCCT 


14940 
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GGTTTGCGGA 


AGAGACGGCT 


GCCGTCGAGC 


GTGCTGTTGA 


TGCCCTGACC 


AAACGGCTGG 


15000 


AAAAACTGGA 


GGCTAAATGA 


CGCCAGGTGA 


AGTACGGCGC 


CTATATTTC7V 


TCATTCGCAC 


15060 


TTTTTTAAGC 


TACGGACTTG 


ATGAACTGAT 


CCCCAAAATG 


CGTATCACCC 


TGGCGCTACG 


15120 


GCTATGGCGA 


TACTCATTAT 


TCTGGATGCC 


AAATCGGCAT 


AAAGACAAAC 


TTTTAGGTGA 


15180 


GCGACTACGA 


CTGGCCCTGC 


AAGAACTGGG 


GCCGGTTTGG 


ATCAAGTTCG 


GGCAAATGTT 


15240 


ATCAACCCGC 


CGCGATCTTT 


TTCCACCGCA 


TATTGCCGAT 


CAGCTGGCGT 


TATTGCAGGA 


15300 


CAAAGTTGCT 


CCGTTTGATG 


GCAAGCTGGC 


GAAGCAGCAG 


ATTGAAGCTG 


CAATGGGCGG 


15360 


CTTGCCGGTA 


GAAGCGTGGT 


TTGACGATTT 


TGAAATCAAG 


CCGCTGGCTT 


CTGCTTCTAT 


15420 


CGCCCAGGTT 


CATACCGCGC 


GATTGAAATC 


GAATGGTAAA 


GAGGTGGTGA 


TTAAAGTCAT 


15480 


CCGCCCGGAT 


ATTTTGCCGG 


TTATTAAAGC 


GGATCTGAAA 


CTTATCTACC 


GTCTGGCTCG 


15540 


CTGGGTGCCG 


CGTTTGCTGC 


CGGATGGTCG 


CCGTCTGCGC 


CCAACCGAAG 


TGGTGCGCGA 


15600 


GTACGAAAAG 


ACATTGATTG 


ATGAACTGAA 


TTTGCTGCGG 


GAATCTGCCA 


ACGCCATTCA 


15660 


GCTTCGGCGC 


AATTTTGAAG 


ACAGCCCGAT 


GCTCTACATC 


CCGGAAGTTT 


ACCCTGACTA 


15720 


TTGTAGTGAA 


GGGATGATGG 


TGATGGAGCG 


CATTTACGGC 


ATTCCGGTGT 


CTGATGTTGC 


15780 


GGCGCTGGAG 


AAAAACGGCA 


CTAACATGAA 


ATTGCTGGCG 


GAACGCGGCG 


TGCAGGTGTT 


15840 


CTTCACTCAG 


GTCTTTCGCG 


ACAGCTTTTT 


CCATGCCGAT 


ATGCACCCTG 


GCAACATCTT 


15900 


CGTAAGCTAT 


GAACACCCGG 


AAAACCCGAA 


ATATATCGGC 


ATTGATTGCG 


GGATTGTTGG 


15960 


CTCGCTAAAC 


AAAGAAGATA 


AACGCTATCT 


GGCAGAAAAC 


TTTATCGCCT 


TCTTTAATCG 


16020 


CGACTATCGC 


AAAGTGGCAG 


AGCTACACGT 


CGATTCTGGC 


TGGGTGCCAC 


CAGATACCAA 


16080 


CGTTGAAGAG 


TTCGAATTTG 


CCATTCGTAC 


GGTCTGTGAA 


CCTATCTTTG 


AGAT^CCGCT 


16140 


GGCCGAAATT 


TCGTTTGGAC 


ATGTACTGTT 


AAATCTGTTT 


AATACGGCGC 


GTCGCTTCAA 


16200 


TATGGAAGTG 


CAGCCGCAAC 


TGGTGTTACT 


CCAGAAAACC 


CTGCTCTACG 


TCGAAGGGGT 


16260 


AGGACGCCAG 


CTTTATCCGC 


AACTCGATTT 


ATGGAAAACG 


GCGAAGCCTT 


TCCTGGAGTC 


16320 


GTGGATTT^ 


GATCAGGTCG 


GTATTCCTGC 


GCTGGTGAGA 


GCATTTAAAG 


AAAAAGCGCC 


16380 


GTTCTGGGTC 


GAAAAAATGC 


CAGAACTGCC 


TGAATTGGTT 


TACGACAGTT 


TGCGCCAGGG 


16440 


CAAGTATTTA 


CAGCACAGTG 


TTGATAAGAT 


TGCCCGCGAG 


CTTCAGTCAA 


ATCATGTACG 


16500 


TCAGGGACAA 


TCGCGTTATT 


TTCTCGGAAT 


TGGCGCTACG 


TTAGTATTAA 


GTGGCACATT 


16560 


CTTGTTGGTC 


AGCCGACCTG 


AATGGGGGCT 


GATGCCCGGC 


TGGTTAATGG 


CAGGTGGTCT 


16620 


GATCGCCTGG 


TTTGTCGGTT 


GGCGCAAAAC 


ACGCTGATTT 


TTTCATCGCT 


CAAGGCGGGC 


16680 


CGTGTAACGT 


ATAATGCGGC 


TTTGTTTAAT 


CATCATCTAC 


CACAGAGGAA 


CATGTATGGG 


16740 


TGGTATCAGT 


ATTTGGCAGT 


TATTGATTAT 


TGCCGTCATC 


GTTGTACTGC 


TTTTTGGCAC 


16800 
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CAAAAAGCTC 


GGCTCCATCG 


GTTCCGATCT 


TGGTGCGTCG 


ATCAAAGGCT 


TTAAAAAAGC 


16860 


AATGAGCGAT 


GATGAACCAA 


AGCAGGATAA 


AACCAGTCAG 


GATGCTGATT 


ttactgcg;^ 


16920 


AACTATCGCC 


GATAAGCAGG 


CGGATACGAA 


TCAGGAACAG 


GCTAAAACAG 


AAGACGCGAA 


16980 


GCGCCACGAT 


AAAGAGCAGG 


TGAATCCGTG 


TTTGATATCG 


GTTTTAGCGA 


ACTTGCTATT 


17040 


GGTGTTCATC 


ATCGGCCTCG 


TCGTTCTGGG 


GCCGCAACGA 


CTGCCTGTGG 


CGGTAAAAAC 


17100 


GGTAGCGGGC 


TGGATTCGCG 


CGTTGCGTTC 


ACTGGCGACA 


ACGGTGCAGA 


ACGAACTGAC 


17160 


CCAGGAGTTA 


AAACTCCAGG 


AGTTTCAGGA 


CAGTCTGAAA 


AAGGTTGAAA 


AGGCGAGCCT 


17220 


CACTAACCTG 


ACGCCCGAAC 


TGAAAGCGTC 


GATGGATGAA 


CTACGCCAGG 


CCGCGGAGTC 


17280 


GATGAAGCGT 


TCCTACGTTG 


CAAACGATCC 


TGAAAAGGCG 


AGCGATGAAG 


CGCACACCAT 


17340 


CCATAACCCG 


GTGGTGAAAG 


ATAATGAAGC 


TGCGCATGAG 


GGCGTAACGC 


CTGCCGCTGC 


17400 


ACAAACGCAG 


GCCAGTTCGC 


CGGAACAGAA 


GCCAGAAACC 


ACGCCAGAGC 


CGGTGGTAAA 


17460 


ACCTGCTGCG 


GACGCTGAAC 


CGAAAACCGC 


TGCACCTTCC 


CCTTCGTCGA 


GTGATAAACC 


17520 


GTAAACATGT 


CTGTAGAAGA 


TACTCAACCG 


CTTATCACGC 


ATCTGATTGA 


GCTGCGTAAG 


17580 


CGTCTGCTGA 


ACTGCATTAT 


CGCGGTGATC 


GTGATATTCC 


TGTGTCTGGT 


CTATTTCGCC 


17640 


AATGACATCT 


ATCACCTGGT 


ATCCGCGCCA 


TTGATCAAGC 


AGTTGCCGCA 


AGGTTCAACG 


17700 


ATGATCGCCA 


CCGACGTGGC 


CTCGCCGTTC 


TTTACGCCGA 


TCAAGCTGAC 


CTTTATGGTG 


17760 


TCGCTGATTC 


TGTCAGCGCC 


GGTGATTCTC 


TATCAGGTGT 


GGGCATTTAT 


CGCCCCAGCG 


17820 


CTGTATAAGC 


ATGAACGTCG 


CCTGGTGGTG 


CCGCTGCTGG 


TTTCCAGCTC 


TCTGCTGTTT 


17880 


TATATCGGCA 


TGGCATTCGC 


CTACTTTGTG 


GTCTTTCCGC 


TGGCATTTGG 


CTTCCTTGCC 


17940 


AATACCGCGC 


CGGAAGGGGT 


GCAGGTATCC 


ACCGACATCG 


CCAGCTATTT 


AAGCTTCGTT 


18000 


ATGGCGCTGT 


TTATGGCGTT 


TGGTGTCTCC 


TTTGT^GTGC 


CGGTAGCAAT 


TGTGCTGCTG 


18060 


TGCTGGATGG 


GGATTACCTC 


GCCAGAAGAC 


TTACGCAAAA 


AACGCCCGTA 


TGTGCTGGTT 


18120 


GGTGCATTCG 


TTGTCGGGAT 


GTTGCTGACG 


CCGCCGGATG 


TCTTCTCGCA 


AACGCTGTTG 


18180 


GCGATCCCGA 


TGTACTGTCT 


GTTTGAAATC 


GGTGTCTTCT 


TCTCACGCTT 


TTACGTTGGT 


18240 


AAAGGGCGAA 


ATCGGGAAGA 


GGAAAACGAC 


GCTGAAGCAG 


AAAGCGAAAA 


AACTGAAGAA 


18300 


TAAATTCAAC 


CGCCCGTCAG 


GGCGGTTGTC 


ATATGGAGTA 


CAGGATGTTT 


GATATCGGCG 


18360 


TTAATTTGAC 


CAGTTCGCAA 


TTTGCGAAAG 


ACCGTGATGA 


TGTTGTAGCG 


TGCGCTTTTG 


18420 


ACGCGGGAGT 


TAATGGGCTA 


CTCATCACCG 


GCACTAACCT 


GCGTGAAAGC 


CAGCAGGCGC 


18480 


AAAAGCTGGC 


GCGTCAGTAT 


TCGTCCTGTT 


GGTCAACGGC 


GGGCGTACAT 


CCTCACGACA 


18540 


GCAGCCAGTG 


GCAAGCTGCG 


ACTGAAGAAG 


CGATTATTGA 


GCTGGCCGCG 


CAGCCAGAAG 


18600 


TGGTGGCGAT 


TGGTGAATGT 


GGTCTCGACT 


TTAACCGCAA 


CTTTTCGACG 


CCGGAAGAGC 


18660 



AGGAACGCGC 


TTTTGTTGCC 


CAGCTACGCA 


TTGCCGCAGA 


TTTAAACATG 


CCGGTATTTA 


18720 


TGCACTGTCG 


CGATGCCCAC 


GAGCGGTTTA 


TGACATTGCT 


GGAGCCGTGG 


CTGGATAAAC 


18780 


TGCCTGGTGC 


GGTTCTTCAT 


TGCTTTACCG 


GCACACGCGA 


AGAGATGCAG 


GCGTGCGTGG 


18840 


CGCATGGAAT 


TTATATCGGC 


ATTACCGGTT 


GGGTTTGCGA 


TGAACGACGC 


GGACTGGAGC 


18900 


TGCGGGAACT 


TTTGCCGTTG 


ATTCCGGCGG 


AAAAATTACT 


GATCGAAACT 


GATGCGCCGT 


18960 


ATCTGCTCCC 


TCGCGATCTC 


ACGCCAAAGC 


CATCATCCCG 


GCGCAACGAG 


CCAGCCCATC 


19020 


TGCCCCATAT 


TTTGCAACGT 


ATTGCGCACT 


GGCGTGGAGA 


AGATGCCGCA 


TGGCTGGCTG 


19080 


CCACCACGGA 


TGCTAATGTC 


AAAACACTGT 


TTGGGATTGC 


GTTTTAGAGT 


TTGCGGAACT 


19140 


CGGTATTCTT 


CACACTGTGC 


TTAATCTCTT 


TATTAAT7UVG 


ATTAAGCAAT 


AGCATGGAGC 


19200 


GAGCCTCACC 


ATCGGGTTCG 


GTGAAAATGG 


CCTGAAAGCC 


TTCGAACGCG 


CCTTCGGTAA 


19260 


TAATCACCTT 


ATCACCCGGA 


TAAGGGGTTG 


CCGGATCGAC 


AATGTCTTTC 


GGTTTATATA 


19320 


CCGATAGCTG 


ATGAATAACC 


GCCGATGGGA 


CTATCGCTGG 


CGACGCGCCA 


AAGCGCACGA 


19380 


AGTGGCTGAC 


ACCGCGGGTC 


GCGTTGATAG 


TCGTGGTATG 


AATCACTTCT 


GGGTCAAATT 


19440 


CCACAAACAG 


GTAGTTGGGG 


AACAATGGCT 


CACTGACTGC 


AGTACGTTTT 


CCACGCACGA 


19500 


TTTTTTCCAG 


GGTGATCATC 


GGTGCCAGGC 


AATTCACAGC 


CTGTCTTTCG 


AGGTGTTCCT 


19560 


GGGCACGTTG 


AAGTTGCCCG 


CGCTTGCAGT 


ACAGTAAATA 


CCAGGATTGC 


ATAATGACTC 


19620 


TTATCCGTTT 


AATCGGGGCG 


CAAGGATAGC 


AAAAGCTTTA 


CGCTAAGTTA 


ATTATATTCC 


19680 


CCGGTTTGCG 


TTATACCGTC 


AGAGTTCACG 


CTAATTTAAC 


AAATTTACAG 


CATCGCAAAG 


19740 


ATGAACGCCG 


TATAATGGGC 


GCAGATTAAG 


AGGCTACAAT 


GGACGCCATG 


AAATATAACG 


19800 


ATTTACGCGA 


CTTCTTGACG 


CTGCTTGAAC 


AGCAGGGTGA 


GCTAAAACGT 


ATCACGCTCC 


19860 


CGGTGGATCC 


GCATCTGGAA 


ATCACTGAAA 


TTGCTGACCG 


CACTTTGCGT 


GCCGGTGGGC 


19920 


CTGCGCTGTT 


GTTCGAAAAC 


CCTAAAGGCT 


ACTCAATGCC 


GGTGCTGTGC 


AACCTGTTCG 


19980 


GTACGCCAAA 


GCGCGTGGCG 


ATGGGCATGG 


GGCAGGAAGA 


TGTTTCGGCG 


CTGCGTGAAG 


20040 


TTGGTAAATT 


ATTGGCGTTT 


CTGAAAGAGC 


CGGAGCCGCC 


AAAAGGTTTC 


CGCGACCTGT 


20100 


TTGATAAACT 


GCCGCAGTTT 


AAGCAAGTAT 


TGAACATGCC 


GACAAAGCGG 


CTGCGTGGTG 


20160 


CGCCCTGCCA 


ACAAAAAATC 


GTCTCTGGCG 


ATGACGTCGA 


TCTCAATCGC 


ATTCCCATTA 


20220 


TGACCTGCTG 


GCCGGAAGAT 


GCCGCGCCGC 


TGATTACCTG 


GGGGCTGACA 


GTGACGCGCG 


20280 


GCCCACATAA 


AGAGCGGCAG 


AATCTGGGCA 


TTTATCGCCA 


GCAGCTGATT 


GGTAAAAACA 


20340 


AACTGATTAT 


GCGCTGGCTG 


TCGCATCGCG 


GCGGCGCGCT 


GGATTATCAG 


GAGTGGTGTG 


20400 


CGGCGCATCC 


GGGCGAACGT 


TTCCCGGTTT 


CTGTGGCGCT 


GGGTGCCGAT 


CCCGCCACGA 


20460 


TTCTCGGTGC 


AGTCACTCCC 


GTTCCGGATA 


CGCTTTCAGA 


GTATGCGTTT 


GCCGGATTGC 


20520 



TACGTGGCAC 


CAAGACCGAA 


GTGGTGAAGT 


GTATCTCCAA 


TGATCTTGAA 


GTGCCCGCCA 


20580 


GTGCGGAGAT 


TGTGCTGGAA 


GGGTATATCG 


AACAAGGCGA 


AACTGCGCCG 


GAAGGGCCGT 


20640 


ATGGCGACCA 


CACCGGTTAC 


TATAATGAAG 


TCGATAGTTT 


CCCGGTATTT 


ACCGTGACGC 


20700 


ATATTACCCA 


GCGTGAAGAT 


GCGATTTACC 


ATTCCACCTA 


TACCGGGCGT 


CCGCCAGATG 


20760 


AGCCCGCGGT 


GCTGGGTGTC 


GCACTGAACG 


AAGTGTTTGT 


GCCGATTCTG 


CAAAAACAGT 


20820 


TCCCGGAAAT 


TGTCGATTTT 


TACCTGCCGC 


CGGAAGGCTG 


CTCTTATCGC 


CTGGCGGTAG 


20880 


TGACAATCAA 


AAAACAGTAC 


GCCGGACACG 


CGAAGCGCGT 


CATGATGGGC 


GTCTGGTCGT 


20940 


TCTTACGCCA 


GTTTATGTAC 


ACTAAATTTG 


TGATCGTTTG 


CGATGATGAC 


GTTAACGCAC 


21000 


GCGACTGGAA 


CGATGTGATT 


TGGGCGATTA 


CCACCCGTAT 


QGACCCGGCG 


CGGGATACTG 


21060 


TTCTGGTAGA 


AAATACGCCT 


ATTGATTATC 


TGGATTTTGC 


CTCGCCTGTC 


TCCGGGCTGG 


21120 


GTTCAAAAAT 


GGGGCTGGAT 


GCCACGAATA 


AATGGCCGGG 


GGAAACCCAG 


CGTGAATGGG 


21180 


GACGTCCCAT 


CAAAAAAGAT 


CCAGATGTTG 


TCGCGCATAT 


TGACGCCATC 


TGGGATGAAC 


21240 


TGGCTATTTT 


TAACAACGGT 


AAAAGCGCCT 


GATGCGCGTT 


TGTTTTGCCC 


TATTTATCGA 


21300 


TCCGACAGAG 


AAAGCGCATG 


ACAACCTTAA 


GCTGTAAAGT 


GACCTCGGTA 


GAAGCTATCA 


21360 


CGGATACCGT 


ATATCGTGTC 


CGCATCGTGC 


CAGACGCGGC 


CTTTTCTTTT 


CGTGCTGGTC 


21420 


AGTATTTGAT 


GGTAGTGATG 


GATGAGCGCG 


ACAAACGTCC 


GTTCTCAATG 


GCTTCGACGC 


21480 


CGGATGAAAA 


AGGGTTTATC 


GAGCTGCATA 


TTGGCGCTTC 


TGAAATCAAC 


CTTTACGCGA 


21540 


AAGCAGTCAT 


GGACCGCATC 


CTCAAAGATC 


ATCAAATCGT 


GGTCGACATT 


CCCCACGGAG 


21600 


AAGCGTGGCT 


GCGCGATGAT 


GAAGAGCGTC 


CGATGATTTT 


GATTGCGGGC 


GGCACCGGGT 


21660 


TCTCTTATGC 


CCGCTCGATT 


TTGCTGACAG 


CGTTGGCGCG 


TAACCCAAAC 


CGTGATATCA 


21720 


CCATTTACTG 


GGGCGGGCGT 


GAAGAGCAGC 


ATCTGTATGA 


TCTCTGCGAG 


CTTGAGGCGC 


21780 


TTTCGTTGAA 


GCATCCTGGT 


CTGCAAGTGG 


TGCCGGTGGT 


TGAACTU^CCG 


GAAGCGGGCT 


21840 


GGCGTGGGCG 


TACTGGCACC 


GTGTTAACGG 


CGGTATTGCA 


GGATCACGGT 


ACGCTGGCAG 


21900 


AGCATGATAT 


CTATATTGCC 


GGACGTTTTG 


AGATGGCGAA 


AATTGCCCGC 


GATCTGTTTT 


21960 


GCAGTGAGCG 


TAATGCGCGG 


GAAGATCGCC 


TGTTTGGCGA 


TGCGTTTGCA 


TTTATCTGAG 


22020 


ATATAAAAAA 


ACCCGCCCCT 


GACAGGCGGG 


AAGAACGGCA ACTAAACTGT 


TATTCAGTGG 


22080 


CATTTAGATC 


TATGACGTAT 


CTGGCAAA 








22108 


(2) INFORMATION FOR SEQ ID NO: 4: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATGCGGCTTT GTTTAATCAT CATCTACCAC AGAGGAACAT GTATGGGTGG TATCAGTATT 60 

TGGCAGTTAT TGATTATTGC CGTCATCGTT GTACTGCTTT TTGGCACCAA AAAGCTCGGC 120 

TCCATCGGTT CCGATCTTGG TGCGTCGATC T^GGCTTTA AAAAAGCAAT GAGCGATGAT 180 

GAACCAAAGC AGGATAAAAC CAGTCAGGAT GCTGATTTTA CTGCGAAAAC TATCGCCGAT 240 

AAGCAGGCGG ATACGAATCA GGAACAGGCT AAAACAGAAG ACGCGAAGCG CCACGATAAA 300 

GAGCAGGTGA ATCCGTGTTT GATATCGGTT TTAGCGAACT TGCTATTGGT GTTCATCATC 360 

GGCCTCGTCG TTCTGGGGCC GCAACGACTG CCTGTGGCGG TAAT^CGGT AGCGGGCTGG 420 

ATTCGCGCGT TGCGTTCACT GGCGACAACG GTGCAGAACG AACTGACCCA GGAGTTAAAA 480 

CTCCAGGAGT TTCAGGACAG TCTGAAAAAG GTTGAAAAGG CGAGCCTCAC TAACCTGACG 540 

CCCGAACTGA AAGCGTCGAT GGATGAACTA CGCCAGGCCG CGGAGTCGAT GAAGCGTTCC 600 

TACGTTGCAA ACGATCCTGA AAAGGCGAGC GATGAAGCGC ACACCATCCA TAACCCGGTG 660 

GTGAAAGATA ATGAAGCTGC GCATGAGGGC GTAACGCCTG CCGCTGCACA AACGCAGGCC 720 

AGTTCGCCGG AACAGAAGCC AGAAACCACG CCAGAGCCGG TGGTAAAACC TGCTGCGGAC 780 

GCTGAACCGA AAACCGCTGC ACCTTCCCCT TCGTCGAGTG ATAAACCGTA A 831 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATGTCTGTAG AAGATACTCA ACCGCTTATC ACGCATCTGA TTGAGCTGCG TAAGCGTCTG 60 

CTGAACTGCA TTATCGCGGT GATCGTGATA TTCCTGTGTC TGGTCTATTT CGCCAATGAC 120 

ATCTATCACC TGGTATCCGC GCCATTGATC AAGCAGTTGC CGCAAGGTTC AACGATGATC 180 

GCCACCGACG TGGCCTCGCC GTTCTTTACG CCGATCAAGC TGACCTTTAT GGTGTCGCTG 240 

ATTCTGTCAG CGCCGGTGAT TCTCTATCAG GTGTGGGCAT TTATCGCCCC AGCGCTGTAT 300 

AAGCATGAAC GTCGCCTGGT GGTGCCGCTG CTGGTTTCCA GCTCTCTGCT GTTTTATATC 360 

GGCATGGCAT TCGCCTACTT TGTGGTCTTT CCGCTGGCAT TTGGCTTCCT TGCCAATACC 420 

GCGCCGGAAG GGGTGCAGGT ATCCACCGAC ATCGCCAGCT ATTTAAGCTT CGTTATGGCG 480 

CTGTTTATGG CGTTTGGTGT CTCCTTTGAA GTGCCGGTAG CAATTGTGCT GCTGTGCTGG 540 
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ATGGGGATTA CCTCGCCAGA AGACTTACGC AAAAAACGCC CGTATGTGCT GGTTGGTGCA 600 

TTCGTTGTCG GGATGTTGCT GACGCCGCCG GATGTCTTCT CGCAAACGCT GTTGGCGATC 660 

CCGATGTACT GTCTGTTTGA AATCGGTGTC TTCTTCTCAC GCTTTTACGT TGGTAAAGGG 720 

CGAAATCGGG AAGAGGAAAA CGACGCTGAA GCAGAAAGCG AAAAAACTGA AGAATAAA 778 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



ATGGAGTACA 


GGATGTTTGA 


TATCGGCGTT 


AATTTGACCA 


GTTCGCAATT 


TGCGAAAGAC 


60 


CGTGATGATG 


TTGTAGCGTG 


CGCTTTTGAC 


GCGGGAGTTA ATGGGCTACT 


CATCACCGGC 


120 


ACTAACCTGC 


GTGAAAGCCA 


GCAGGCGCAA 


AAGCTGGCGC 


GTCAGTATTC 


GTCCTGTTGG 


180 


TCAACGGCGG 


GCGTACATCC 


TCACGACAGC 


AGCCAGTGGC 


AAGCTGCGAC 


TGAAGAAGCG 


240 


ATTATTGAGC 


TGGCCGCGCA 


GCCAGAAGTG 


GTGGCGATTG 


GTGAATGTGG 


TCTCGACTTT 


300 


AACCGCAACT 


TTTCGACGCC 


GGAAGAGCAG 


GAACGCGCTT 


TTGTTGCCCA 


GCTACGCATT 


360 


GCCGCAGATT 


TAAACATGCC 


GGTATTTATG 


CACTGTCGCG 


ATGCCCACGA 


GCGGTTTATG 


420 


ACATTGCTGG 


AGCCGTGGCT 


GGATAAACTG 


CCTGGTGCGG 


TTCTTCATTG 


CTTTACCGGC 


480 


ACACGCGAAG 


AGATGCAGGC 


GTGCGTGGCG 


CATGGAATTT 


ATATCGGCAT 


TACCGGTTGG 


540 


GTTTGCGATG 


AACGACGCGG 


ACTGGAGCTG 


CGGGAACTTT 


TGCCGTTGAT 


TCCGGCGGAA 


600 


AAATTACTGA 


TCGAAACTGA 


TGCGCCGTAT 


CTGCTCCCTC 


GCGATCTCAC 


GCCAAAGCCA 


660 


TCATCCCGGC 


GCAACGAGCC 


AGCCCATCTG 


CCCCATATTT 


TGCAACGTAT 


TGCGCACTGG 


720 


CGTGGAGAAG 


ATGCCGCATG 


GCTGGCTGCC 


ACCACGGATG 


CTAATGTCAA 


AACACTGTTT 


780 


GGGATTGCGT 


TTTAG 










795 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Ser Val Glu Asp Thr Gin Pro Leu lie Thr His Leu lie Glu Leu 
15 10 15 
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Arg Lys Arg Leu Leu 
20 

Cys Leu Val Tyr Phe 
35 

Leu lie Lys Gin Leu 
50 

Ala Ser Pro Phe Phe 
65 

lie Leu Ser Ala Pro 
85 

Pro Ala Leu Tyr Lys 
100 

Ser Ser Ser Leu Leu 
115 

Val Phe Pro Leu Ala 
130 



Asn Cys He He Ala Val He Val He Phe Leu 
25 30 



Ala Asn Asp He Tyr His Leu Val Ser Ala Pro 
40 45 

Pro Gin Gly Ser Thr Met He Xaa Xaa Asp Val 
55 60 

Thr Pro He Lys Leu Thr Phe Met Val Ser Leu 
70 75 80 



Val He Leu Tyr Gin Val Trp Ala 
90 

His Glu Arg Arg Leu Val Val Pro 
105 

Phe Leu Tyr Arg His Ala Phe Ala 

120 125 

Phe Gly Phe Leu Ala Asn Thr Ala 
135 140 



Phe He Ala 
95 

Leu Leu Val 
110 

Tyr Phe Val 
Pro Glu Gly 



Val Gin Val Ser Thr 
145 

Leu Phe Met Ala Phe 

165 



Asp He Ala Ser Tyr Leu Ser Phe 
150 155 

Gly Val Ser Phe Glu Val Pro Val 

170 



Val Met Ala 
160 

Ala He Val 
175 



Leu Leu Cys Trp Met 
180 

Arg Pro Tyr Val Leu 
195 



Gly He Thr Ser Pro Glu Asp Leu Arg Lys Lys 
185 190 



Pro Pro Asp Val Phe 
210 

Leu Phe Glu He Gly 
225 

Arg Asn Arg Glu Glu 
245 



Val Gly Ala Phe Val Val Gly Met Leu Leu Thr 
200 205 

Ser Gin Thr Leu Leu Ala He Pro Met Tyr Cys 
215 220 

Val Phe Phe Ser Arg Phe Tyr Val Gly Lys Gly 
230 235 240 

Glu Asn Asp Ala Glu Ala Glu Ser Glu Lys Thr 
250 255 



Glu Glu 



INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Glu Tyr Arg Met Phe Asp He Gly Val Asn Leu Thr Ser Ser Gin 
15 10 15 
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Phe Ala Lys Asp Arg Asp Asp Val Val Ala Cys Ala Phe Asp Ala Gly 

20 25 30 

Val Asn Gly Leu Leu lie Thr Gly Thr Asn Leu Arg Glu Ser Gin Gin 
35 40 45 

Ala Gin Lys Leu Ala Arg Gin Tyr Ser Ser Cys Trp Ser Thr Ala Gly 
50 55 60 

Val His Pro His Asp Ser Ser Gin Trp Gin Ala Ala Thr Glu Glu Ala 
65 70 75 80 

He He Glu Leu Ala Ala Gin Pro Glu Val Val Ala He Gly Glu Cys 
85 90 95 

Gly Leu Asp Phe Asn Arg Asn Phe Ser Thr Pro Glu Glu Gin Glu Arg 
100 105 110 

Ala Phe Val Ala Gin Leu Arg He Ala Ala Asp Leu Asn Met Pro Val 
115 120 125 

Phe Met His Cys Arg Asp Ala His Glu Arg Phe Met Thr Leu Leu Glu 
130 135 140 

Pro Trp Leu Asp Lys Leu Pro Gly Ala Val Leu His Cys Phe Thr Gly 
145 150 155 160 

Thr Arg Glu Glu Met Gin Ala Cys Val Ala His Gly He Tyr He Gly 
165 170 175 

He Thr Gly Trp Val Cys Asp Glu Arg Arg Gly Leu Glu Leu Arg Glu 
180 185 190 

Leu Leu Pro Leu He Pro Ala Glu Lys Leu Leu He Glu Thr Asp Ala 
195 200 205 

Pro Tyr Leu Leu Pro Arg Asp Leu Thr Pro Lys Pro Ser Ser Arg Arg 
210 215 220 

Asn Glu Pro Ala His Leu Pro His He Leu Gin Arg He Ala His Trp 
225 230 235 240 

Arg Gly Glu Asp Ala Ala Trp Leu Ala Ala Thr Thr Asp Ala Asn Val 
245 250 255 

Lys Thr Leu Phe Gly He Ala Phe 
260 

INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Thr Pro Thr Ala Asn Leu Leu Leu Pro Ala Pro Pro Phe Val Pro 
15 10 15 
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lie Ser Asp Val Arg Arg 
20 

Pro Arg Pro Cys Trp Lys 
35 

Met Val Ser Ser Phe Val 
50 

Val lie Cys Ala Ser Leu 
65 70 

He Gly Val Val Ala Leu 
85 

Val Ala Arg Asn Leu Gly 
100 

Arg Glu Leu Gin Asp Val 
115 

Glu He Gly He Asp Glu 
130 

Thr Met Asn Asn Asn Gin 
145 150 

Glu Pro Ala Pro Tyr Thr 

165 

Gin He Ala Ala Ser Ala 
180 

Thr Ser Gin Gin Gin Glu 
195 

Ala Pro Thr Ser Gly Gly 
210 

Val Ser Asp Ser Asp Pro 
225 230 



Leu Gin Leu Pro Pro Arg Val Arg His Gin 
25 30 

Gly Val Glu Trp Gly Ser He Gin Thr Arg 
40 45 

Ala Val Gly Ser Arg Thr Arg Arg Arg Asn 
55 60 

Phe Gly Val Gly Ala Pro Glu Ala Leu Val 
75 80 

Leu Val Phe Gly Pro Lys Gly Leu Ala Glu 
90 95 

Lys Thr Leu Arg Ala Phe Gin Pro Thr He 
105 110 

Ser Arg Glu Phe Arg Ser Thr Leu Glu Arg 
120 125 

Val Ser Gin Ser Thr Asn Tyr Arg Pro Thr 
135 140 

Gin Pro Ala Ala Asp Pro Asn Val Lys Pro 
155 160 

Ser Glu Glu Leu Met Lys Val Thr Glu Glu 
170 175 

Ala Ala Ala Trp Asn Pro Gin Gin Pro Ala 
185 190 

Glu Ala Pro Thr Thr Pro Arg Ser Glu Asp 
200 205 

Ser Asp Gly Pro Ala Ala Pro Ala Arg Ala 
215 220 

Asn Gin Val Asn Lys Ser Gin Lys Ala Glu 
235 240 



Gly Glu Arg 



INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION; SEQ ID NOilO: 

Met Gly Glu He Ser He Thr Lys Leu Leu Val Val Ala Ala Leu Val 
15 10 15 

Val Leu Leu Phe Gly Thr Lys Lys Leu Arg Thr Leu Gly Gly Asp Leu 
20 25 30 
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Gly Ala Ala lie Lys Gly Phe Lys Lys Ala Met Asn Asp Asp Asp Ala 
35 40 45 

Ala Ala Lys Lys Gly Ala Asp Val Asp Leu Gin Ala Glu Lys Leu Ser 
50 55 60 

His Lys Glu 
65 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ala Leu Thr Leu Val Met Gly Ala He Ala Ser Pro Trp Val Ser 
15 10 15 

Val Gly Thr Lys Leu Cys Tyr Ser Arg Leu Asn Glu Ser Phe Tyr Pro 
20 25 30 

Ser Asn Pro Leu Thr Ala Pro Asn Pro Met Asn He Phe Gly He Gly 
35 40 45 

Leu Pro Glu Leu Gly Leu He Phe Val He Ala Leu Leu Val Phe Gly 
50 55 60 

Pro Lys Lys Leu Pro Glu Val Gly Arg Ser Leu Gly Lys Ala Leu Arg 
65 70 75 80 

Gly Phe Gin Glu Ala Ser Lys Glu Phe Glu Thr Glu Leu Lys Arg Glu 
85 90 95 

Ala Gin Asn Leu Glu Lys Ser Val Gin He Lys Ala Glu Leu Glu Glu 
100 105 110 

Ser Lys Thr Pro Glu Ser Ser Ser Ser Ser Glu Lys Ala Ser 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: xinknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Gly Ala Met Ser Pro Trp His Trp Ala He Val Ala Leu Val Val 
15 10 15 

Val He Leu Phe Gly Ser Lys Lys Leu Pro Asp Ala Ala Arg Gly Leu 
20 25 30 
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Gly Arg Ser Leu Arg lie Phe Lys Ser Glu Val Lys Glu Met Gin Asn 
35 40 45 

Asp Asn Ser Thr Pro Ala Pro Thr Ala Gin Ser Ala Pro Pro Pro Gin 
50 55 60 

Ser Ala Pro Ala Glu Leu Pro Val Ala Asp Thr Thr Thr Ala Pro Val 
65 70 75 80 

Thr Pro Pro Ala Pro Val Gin Pro Gin Ser Gin His Thr Glu Pro Lys 
85 90 95 

Ser Ala 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Met Gly lie Ser Val Trp Gin Leu Leu lie lie Leu Leu lie Val 
15 10 15 

Val Met Leu Phe Gly Thr Lys Arg Leu Arg Gly Leu Gly Ser Asp Leu 
20 25 30 

Gly Ser Ala lie Asn Gly Phe Arg Lys Ser Val Ser Asp Gly Glu Thr 
35 40 45 

Thr Thr Gin Ala Glu Ala Ser Ser Arg Ser 
50 55 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D ) TOPOLOGY : unJcnown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Gly Ser Leu Ser Pro Trp His Trp Val Val Leu Val Val Val Val 
15 10 15 

Val Leu Leu Phe Gly Ala Lys Lys Leu Pro Asp Ala Ala Arg Ser Leu 
20 25 30 

Gly Lys Ser Met Arg lie Phe Lys Ser Glu Leu Arg Glu Met Gin Thr 
35 40 45 

Glu Asn Gin Ala Gin Ala Ser Ala Leu Glu Thr Pro Met Gin Asn Pro 
50 55 60 
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Thr Val Val Gin Ser Gin Arg Val Val Pro Pro Trp Ser Thr Glu Gin 
65 70 75 80 

Asp His Thr Glu Ala Arg Pro Ala 
85 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Gly Gly Phe Thr Ser He Trp His Trp Val He Val Leu Leu Val 
15 10 15 

He Val Leu Leu Phe Gly Ala Lys Lys He Pro Glu Leu Ala Lys Gly 
20 25 30 

Leu Gly Ser Gly He Lys Asn Phe Lys Lys Ala Val Lys Asp Asp Glu 
35 40 45 

Glu Glu Ala Lys Asn Glu Pro Lys Thr Leu Asp Ala Gin Ala Thr Gin 
50 55 60 

Thr Lys Val His Glu Ser Ser Glu He Lys Ser Lys Gin Glu Ser 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Ala Lys Lys Ser He Phe Arg Ala Lys Phe Phe Leu Phe Tyr Arg 
15 10 15 

Thr Glu Phe He Met Phe Gly Leu Ser Pro Ala Gin Leu He He Leu 
20 25 30 

Leu Val Val He Leu Leu He Phe Gly Thr Lys Lys Leu Arg Asn Ala 
35 40 45 

Gly Ser Asp Leu Gly Ala Ala Val Lys Gly Phe Lys Lys Ala Met Lys 
50 55 60 

Glu Asp Glu Lys Val Lys Asp Ala Glu Phe Lys Ser He Asp Asn Glu 
65 70 75 80 

Thr Ala Ser Ala Lys Lys Gly Lys Tyr Lys Arg Glu Arg Asn Arg Leu 
85 90 95 

-23- 



Asn Pro Cys Leu lie Leu Val Phe Gin Asn Leu Phe Tyr 
100 105 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Pro He Gly Pro Gly Ser Leu Ala Val He Ala He Val Ala Leu 
15 10 15 

He He Phe Gly Pro Lys Lys Leu Pro Glu Leu Gly Lys Ala Ala Gly 

20 25 30 

Asp Thr Leu Arg Glu Phe Lys Asn Ala Thr Lys Gly Leu Thr Ser Asp 
35 40 45 

Glu Glu Glu Lys Lys Lys Glu Asp Gin 
50 55 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Gly Phe Gly Gly He Ser He Trp Gin Leu Leu He He Leu Leu 
15 10 15 

He Val Val Met Leu Phe Gly Thr Lys Arg Leu Lys Ser Leu Gly Ser 
20 25 30 

Asp Leu Gly Asp Ala He Lys Gly Phe Arg Lys Ser Met Asp Asn Glu 
35 40 45 

Glu Asn Lys Ala Pro Pro Val Glu Glu Gin Lys Gly Gin Asp His Arg 
50 55 60 

Gly Pro Gly Pro Gin Gly Arg Gly Thr Gly Gin Glu Arg Leu Ser Met 
65 70 75 80 

Phe Asp He Gly Phe Ser Glu Leu Leu Leu Val Gly Leu Val Ala Leu 
85 90 95 

Leu Val Leu Gly Pro Glu Arg Leu Pro Val Ala Ala Arg Met Ala Gly 
100 105 110 

Leu Trp He Gly Arg Leu Lys Arg Ser Phe Asn Thr Leu Lys Thr Glu 
115 120 125 
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Val Glu Arg Glu lie Gly Ala Asp Glu lie Arg Arg Gin Leu His Asn 
130 135 140 

Glu Arg He Leu Glu Leu Glu Arg Glu Met Lys Gin Ser Leu Gin Pro 
145 150 155 160 

Pro Ala Pro Ser Ala Pro Asp Glu Thr Ala Ala Ser Pro Ala Thr Pro 
165 170 175 

Pro Gin Pro Ala Ser Pro Ala Ala His Ser Asp Lys Thr Pro Ser Pro 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Thr Glu His Leu Glu Glu Leu Arg Gin Arg Thr Val Phe Val Phe He 
15 10 15 

Phe Phe Leu Leu Ala Ala Thr He Ser Phe Thr Gin He Lys He He 

20 25 30 

Val Glu He Phe Gin Ala Pro Ala He Gly He Lys Phe Leu Gin Leu 
35 40 45 

Ala Pro Gly Glu Tyr Phe Phe Ser Ser He Lys He Ala He Tyr Cys 
50 55 60 

Gly He Val Ala Thr Thr Pro Phe Gly Val Tyr Gin Val He Leu Tyr 
65 70 75 80 

He Leu Pro Gly Leu Thr Asn Lys Glu Arg Lys Val He Leu Pro He 
85 90 95 

Leu He Gly Ser He Val Leu Phe He Val Gly Gly He Phe Ala Tyr 
100 105 110 

Phe Val Leu Ala Pro Ala Ala Leu Asn Phe Leu He Ser Tyr Gly Ala 
115 120 125 

Asp He Val Glu Pro Leu Trp Ser Phe Glu Gin Tyr Phe Asp Phe He 
130 135 140 

Leu Leu Leu Leu Phe Ser Thr Gly Leu Ala Phe Glu He Pro 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Lys Thr lie Leu Glu Glu Val Arg lie Arg Val Phe Trp lie Leu lie 
15 10 15 

Cys Phe Ser Phe Thr Trp Phe Thr Cys Tyr Trp Phe Ser Glu Glu Phe 
20 25 30 

lie Phe Leu Leu Ala Lys Pro Phe Leu Thr Leu Pro Tyr Leu Asp Ser 
35 40 45 

Ser Phe lie Cys Thr Gin Leu Thr Glu Ala Leu Ser Thr Tyr Val Thr 
50 55 60 

Thr Ser Leu lie Ser Cys Phe Tyr Phe Leu Phe Pro Phe Leu Ser Tyr 
65 70 75 80 

Gin lie Trp Cys Phe Leu Met Pro Ser Cys Tyr Glu Glu Gin Arg Lys 
85 90 95 

Lys Tyr Asn Lys Leu Phe Tyr Leu Ser Gly Phe Cys Phe Phe Leu Phe 
100 105 110 

Phe Phe Val Thr Phe Val Trp lie Val Pro Asn Val Trp His Phe Leu 
115 120 125 

Tyr Lys Leu Ser Thr Thr Ser Thr Asn Leu Leu lie lie Lys Leu Gin 
130 135 140 

Pro Lys lie Phe Asp Tyr lie Met Leu Thr Val Arg lie Leu Phe lie 
145 150 155 160 

Ser Ser lie Cys Ser Gin Val Pro 
165 

INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 

{C) STRANDEDNESS : not relevant 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Glu Thr lie Leu Gly Glu Val Arg lie Arg Ser Val Arg lie Leu lie 

15 10 15 

Gly Leu Gly Leu Thr Trp Phe Thr Cys Tyr Trp Phe Pro Glu Glu Leu 
20 25 30 

lie Ser Pro Leu Ala Ser Pro Phe Leu Thr Leu Pro Phe Asp Ser Tyr 
35 40 45 

Phe Val Cys Thr Gin Leu Thr Glu Ala Phe Ser Thr Phe Val Ala Thr 
50 55 60 

Ser Ser lie Ala Cys Ser Tyr Phe Val Phe Pro Leu lie Ser Tyr Gin 
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65 70 75 80 

lie Trp Cys Phe Leu lie Pro Ser Cys Tyr Gly Glu Gin Arg Thr Lys 

85 90 95 

Tyr Asn Arg Phe Leu His Leu Ser Gly Ser Arg Phe Phe Leu Phe Leu 
100 105 110 

Phe Leu Thr Pro Pro Arg Val Val Pro Asn Val Trp His Phe Pro Tyr 
115 120 125 

Phe Val Gly Ala Thr Ser Thr Asn Ser Leu Met lie Lys Leu Gin Pro 
130 135 140 

Lys lie Tyr Asp His lie Met Leu Thr Val Arg lie Ser Phe lie Pro 
145 150 155 160 

Ser Val Cys Ser Gin Val Pro 
165 

INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Leu Thr His Leu Tyr Glu lie Arg Leu Arg lie lie Tyr Leu Leu Tyr 
15 10 15 

Ser lie Phe Leu Thr Cys Phe Cys Ser Tyr Gin Tyr Lys Glu Glu lie 
20 25 30 

Phe Tyr Leu Leu Phe lie Pro Leu Ser Lys Asn Phe lie Tyr Thr Asp 
35 40 45 

Leu lie Glu Ala Phe lie Thr Tyr lie Lys Leu Ser lie lie Val Gly 
50 55 60 

lie Tyr Leu Ser Tyr Pro lie Phe Leu Tyr Gin lie Trp Ser Phe Leu 
65 70 75 80 

lie Pro Gly Phe Phe Leu Tyr Glu Lys Lys Leu Phe Arg Leu Leu Cys 
85 90 95 

Leu Thr Ser lie Phe Leu Tyr Phe Leu Gly Ser Cys lie Gly Tyr Tyr 
100 105 110 

Leu Leu Phe Pro lie Ala Phe Thr Phe Phe Leu Gly Phe Gin Lys Leu 
115 120 125 

Gly Lys Asp Gin Leu Phe Thr lie Glu Leu Gin Ala Lys lie His Glu 
130 135 140 

Tyr Leu lie Leu Asn Thr Lys Leu lie Phe Ser Leu Ser lie Cys Phe 
145 150 155 160 
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Gin Leu Pro 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Phe Asp His Leu Asp Glu Leu Arg Thr Arg lie Phe Leu Ser Leu Gly 
15 10 15 

Ala Val Leu Val Gly Val Val Ala Cys Phe He Phe Val Lys Pro Leu 
20 25 30 

Val Gin Trp Leu Gin Val Pro Ala Gly Thr Val Lys Phe Leu Gin Leu 
35 40 45 

Ser Pro Gly Glu Phe Phe Phe Val Ser Val Lys Val Ala Gly Tyr Ser 
50 55 60 

Gly He Leu Val Met Ser Pro Phe He Leu Tyr Gin He He Gin Phe 
65 70 75 80 

Val Leu Pro Gly Leu Thr Arg Arg Glu Arg Arg Leu Leu Gly Pro Val 
85 90 95 

Val Leu Gly Ser Ser Val Leu Phe Phe Ala Gly Leu Gly Phe Ala Tyr 
100 105 110 

Tyr Ala Leu He Pro Ala Ala Leu Lys Phe Phe Val Ser Tyr Gly Ala 
115 120 125 

Asp Val Val Glu Gin Leu Trp Ser He Asp Lys Tyr Phe Glu Phe Val 
130 135 140 

Leu Leu Leu Met Phe Ser Thr Gly Leu Ala Phe Gin He Pro 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Val Asp His Leu Thr Glu Leu Arg Thr Arg Leu Leu He Ser Leu Ala 
15 10 15 

Ala He Leu Val Thr Thr He Phe Gly Phe Val Trp Tyr Ser His Ser 
20 25 30 
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lie Phe Gly Leu Asp Ser Leu Gly Glu Trp Leu Arg His Pro Tyr Cys 

35 40 45 

Ala Leu Pro Gin Ser Ala Arg Ala Asp lie Ser Ala Asp Gly Glu Cys 
50 55 60 

Arg Leu Leu Ala Thr Ala Pro Phe Asp Gin Phe Met Leu Arg Leu Lys 
65 70 75 80 

Val Gly Met Ala Ala Gly He Val Leu Ala Cys Pro Val Trp Phe Tyr 
85 90 95 

Gin Leu Trp Ala Phe He Thr Pro Gly Leu Tyr Gin Arg Glu Arg Arg 
100 105 110 

Phe Ala Val Ala Phe Val He Pro Ala Ala Val Leu Phe Val Ala Gly 
115 120 125 

Ala Val Leu Ala Tyr Leu Val Leu Ser Lys Ala Leu Gly Phe Leu Leu 
130 135 140 

Thr Val Gly Ser Asp Val Gin Val Thr Ala Leu Ser Gly Asp Arg Tyr 
145 150 155 160 

Phe Gly Phe Leu Leu Asn Leu Leu Val Val Phe Gly Val Ser Phe Glu 
165 170 175 

Phe Pro 



INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

His Leu Gin Glu Leu Arg Lys Arg Leu Met Val Ser Val Gly Thr He 
15 10 15 

Leu Val Ala Phe Leu Gly Cys Phe His Phe Trp Lys Ser He Phe Glu 

20 25 30 

Phe Val Lys Asn Ser Tyr Lys Gly Thr Leu He Gin Leu Ser Pro He 
35 40 45 

Glu Gly Val Met Val Ala Val Lys He Ser Phe Ser Ala. Ala He Val 
50 55 60 

He Ser Met Pro He He Phe Trp Gin Leu Trp Leu Phe He Ala Pro 
65 70 75 80 

Gly Leu Tyr Lys Asn Glu Lys Lys Val He Leu Pro Phe Val Phe Phe 
85 90 95 

Gly Ser Gly Met Phe Leu He Gly Ala Ala Phe Ser Tyr Tyr Val Val 
100 105 110 
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Phe Pro Phe lie lie Glu Tyr Leu Ala Thr Phe Gly Ser Asp Val Phe 
115 120 125 

Ala Ala Asn lie Ser Ala Ser Ser Tyr Val Ser Phe Phe Thr Arg Leu 
130 135 140 

lie Leu Gly Phe Gly Val Ala Phe Glu Leu Pro 
145 150 155 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

lie Thr His Leu Val Glu Leu Arg Asn Arg Leu Leu Arg Cys Val lie 
15 10 15 

Cys Val Val Leu Val Phe Val Ala Leu Val Tyr Phe Ser Asn Asp lie 
20 25 30 

Tyr His Phe Val Ala Ala Pro Leu Thr Ala Val Met Pro Lys Gly Ala 
35 40 45 

Thr Met lie Ala Thr Asn lie Gin Thr Pro Phe Phe Thr Pro lie Lys 
50 55 60 

Leu Thr Ala lie Val Ala lie Phe lie Ser Val Pro Tyr Leu Leu Tyr 
65 70 75 80 

Gin lie Trp Ala Phe lie Ala Pro Ala Leu Tyr Gin His Glu Lys Arg 

85 90 95 

Met lie Tyr Pro Leu Leu Phe Ser Ser Thr lie Leu Phe Tyr Cys Gly 
100 105 110 

Val Ala Phe Ala Tyr Tyr lie Val Phe Pro Leu Val Phe Ser Phe Phe 
115 120 125 

Thr Gin Thr Ala Pro Glu Gly Val Thr He Ala Thr Asp He Ser Ser 
130 135 140 

Tyr Leu Asp Phe Ala Leu Ala Leu Phe Leu Ala Phe Gly Val Cys Phe 
145 150 155 160 

Glu Val Pro 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Leu Glu His lie Ala Glu Leu Arg Lys Arg Leu Leu lie Val Ala Leu 
15 10 15 

Ala Phe Val Val Phe Phe lie Ala Gly Phe Phe Leu Ala Lys Pro He 

20 25 30 

He Val Tyr Leu Gin Glu Thr Asp Glu Ala Lys Gin Leu Thr Leu Asn 
35 40 45 

Ala Phe Asn Leu Thr Asp Pro Leu Tyr Val Phe Met Gin Phe Ala Phe 
50 55 60 

He He Gly He Val Leu Thr Ser Pro Val He Leu Tyr Gin Leu Trp 
65 70 75 80 

Ala Phe Val Ser Pro Gly Leu Tyr Glu Lys Glu Arg Lys Val Thr Leu 
85 90 95 

Ser Tyr He Pro Val Ser He Leu Leu Phe Leu Ala Gly Leu Ser Phe 
100 105 110 

Ser Tyr Tyr He Leu Phe Pro Phe Val Val Asp Phe Met Lys Arg He 
115 120 125 

Ser Gin Asp Leu Asn Val Asn Gin Val He Gly He Asn Glu Tyr Phe 
130 135 140 

His Phe Leu Leu Gin Leu Thr He Pro Phe Gly Leu Leu Phe Gin Met 
145 150 155 160 

Pro 



INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: iinknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Val Ala His Leu Thr Glu Leu Arg Ser Arg Leu Leu Arg Ser Val Ala 

15 10 15 

Ala Val Leu Leu He Phe Ala Ala Leu Phe Tyr Phe Ala Gin Asp He 
20 25 30 

Tyr Ala Leu Val Ser Ala Pro Leu Arg Ala Tyr Leu Pro Glu Gly Ala 
35 40 45 

Thr Met He Ala Thr Gly Val Ala Ser Pro Phe Leu Ala Pro Phe Lys 
50 55 60 

Leu Thr Leu Met He Ser Leu Phe Leu Ala Met Pro Val Val Leu His 
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65 70 75 80 

Gin Val Trp Gly Phe lie Ala Pro Gly Leu Tyr Gin His Glu Lys Arg 
85 90 95 

lie Ala Met Pro Leu Met Ala Ser Ser Val Leu Leu Phe Tyr Ala Gly 
100 105 110 

Met Ala Phe Ala Tyr Phe Val Val Phe Pro He Met Phe Gly Phe Phe 
115 120 125 

Ala Ser Val Thr Pro Glu Gly Val Ala Met Met Thr Asp He Gly Gin 
130 135 140 

Tyr Leu Asp Phe Val Leu Thr Leu Phe Phe Ala Phe Gly Val Ala Phe 
145 150 155 160 

Glu Val Pro 



INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

He Ala Leu He Val He Val Val Ser Ser Leu Phe Phe Thr Phe Gly 
15 10 15 

Ala Asn He Val Val Gly Lys He He Gly Asp Leu Phe Pro Gly Glu 
20 25 30 

Ala Val He Glu Asn Arg Asp Lys He Leu Ala He Ala Glu Glu Leu 
35 40 45 

Lys Lys He Ala Ser Asp Leu Glu Asn Tyr Ala Tyr His Pro Ser Glu 
50 55 60 

Ala Asn Arg Ser He Ala Phe Ala Ala Ser Lys Ser Leu Val Arg He 
65 70 75 80 

Ala Met Gin Leu Ser Thr Ser Pro Val Leu Leu Thr Pro Leu Glu Gly 
85 90 95 

Leu Leu Leu Tyr Leu Lys He Ser Leu Ala Val Gly He Ala Ala Ala 
100 105 110 

Leu Pro Tyr He Phe His Leu Val Leu Thr Ala Leu Arg Glu Arg Gly 
115 120 125 

Val He Thr Phe Ser Phe Arg Lys Thr Ser Ala Phe Lys Tyr Gly Met 
130 135 140 

Ala Ala He Phe Leu Phe Ala Leu Gly He Phe Tyr Gly Tyr Asn Met 
145 150 155 160 
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Met Lys Phe Phe He Lys Phe Leu Tyr Leu Met Ala Val Ser Gin Gly 
165 170 175 

Ala He Pro Leu Tyr Ser Leu Ser Glu Phe Val Asn Phe Val Ala Leu 
180 185 190 

Met Leu Val Leu Phe Gly He Val Phe Glu Leu Pro 
195 200 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Asp Val Glu Asp Leu Arg Arg Leu Ala Ala Glu Glu Gly Val Val Ala 
15 10 15 

Leu Gly Glu Thr Gly Leu Asp Tyr Tyr Tyr Thr Pro Glu Thr Lys Val 
20 25 30 

Arg Gin Gin Glu Ser Phe He His His He Gin He Gly Arg Glu Leu 
35 40 45 

Asn Lys Pro Val He Val His Thr Arg Asp Ala Arg Ala Asp Thr Leu 
50 55 60 

Ala He Leu Arg Glu Glu Lys Val Thr Asp Cys Gly Gly Val Leu His 
65 70 75 80 

Cys Phe Thr Glu Asp Arg Glu Thr Ala Gly Lys Leu Leu Asp Leu Gly 
85 90 95 

Phe Tyr He Ser Phe Ser Gly He Val Thr Phe Arg Asn Ala Glu Gin 
100 105 110 

Leu Arg Asp Ala Ala Arg Tyr Val Pro Leu Asp Arg Leu Leu Val Glu 
115 120 125 

Thr Asp Ser Pro Tyr Leu Ala Pro 
130 135 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ser Leu Glu Gin Leu Gin Gin Ala Leu Glu Arg Arg Pro Ala Lys Val 
15 10 15 
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Val Ala Val Gly Glu lie Gly Leu Asp Leu Phe Gly Asp Asp Pro Gin 

20 25 30 

Phe Glu Arg Gin Gin Trp Leu Leu Asp Glu Gin Leu Lys Leu Ala Lys 
35 40 45 

Arg Tyr Asp Leu Pro Val lie Leu His Ser Arg Arg Thr His Asp Lys 
50 55 60 

Leu Ala Met His Leu Lys Arg His Asp Leu Pro Arg Thr Gly Val Val 
65 70 75 80 

His Gly Phe Ser Gly Ser Leu Gin Gin Ala Glu Arg Phe Val Gin Leu 
85 90 95 

Gly Tyr Lys lie Gly Val Gly Gly Thr lie Thr Tyr Pro Arg Ala Ser 
100 105 110 

Lys Thr Arg Asp Val lie Ala Lys Leu Pro Leu Ala Ser Leu Leu Leu 
115 120 125 

Glu Thr Asp Ala Pro Asp Met Pro Leu 
130 135 

INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Leu lie Gly Glu Val Val Ser Gin lie Glu Ser Asn lie Asp Leu lie 
15 10 15 

Val Ala Val Gly Glu Thr Gly Met Asp Phe His His Thr Arg Asp Glu 
20 25 30 

Glu Gly Arg Arg Arg Gin Glu Glu Thr Phe Arg Val Phe Val Glu Leu 
35 40 45 

Ala Ala Glu His Glu Met Pro Leu Val Val His Ala Arg Asp Ala Glu 
50 55 60 

Glu Arg Ala Leu Glu Thr Val Leu Glu Tyr Arg Val Pro Glu Val lie 
65 70 75 80 

Phe His Cys Tyr Gly Gly Ser lie Glu Thr Ala Arg Arg lie Leu Asp 
85 90 95 

Glu Gly Tyr Tyr lie Ser lie Ser Thr Leu Val Ala Phe Ser Glu His 
100 105 110 

His Met Glu Leu Val Arg Ala lie Pro Leu Glu Gly Met Leu Thr Glu 
115 120 125 

Thr Asp Ser Pro Tyr Leu Ser 
130 135 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

Ala Gin Ala Thr Leu Lys Lys Leu Val Ser Thr His Arg Ser Phe lie 
15 10 15 

Ser Cys lie Gly Glu Tyr Gly Phe Asp Tyr His Tyr Thr Lys Asp Tyr 
20 25 30 

lie Thr Gin Gin Glu Gin Phe Phe Leu Met Gin Phe Gin Leu Ala Glu 
35 40 45 

Gin Tyr Gin Leu Val His Met Leu His Val Arg Asp Val His Glu Arg 
50 55 60 

lie Tyr Glu Val Leu Lys Arg Leu Lys Pro Lys Gin Pro Val Val Phe 
65 70 75 80 

His Cys Phe Ser Glu Asp Thr Asn Thr Ala Leu Lys Leu Leu Thr Leu 
85 90 95 

Arg Glu Val Gly Leu Lys Val Tyr Phe Ser lie Pro Gly lie Val Thr 
100 105 110 

Phe Lys Asn Ala Lys Asn Leu Gin Ala Ala Leu Ser Val lie Pro Thr 
115 120 125 

Glu Leu Leu Leu Ser Glu Thr Asp Ser Pro Tyr Leu Ala Pro 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Ala Arg Ala Glu Leu Glu Arg Leu Val Ala His Pro Arg Val Val Ala 
15 10 15 

Val Gly Glu Thr Gly lie Asp Met Tyr Trp Pro Gly Arg Leu Asp Gly 
20 25 30 

Cys Ala Glu Pro His Val Gin Arg Glu Ala Phe Ala Trp His lie Asp 
35 40 45 

Leu Ala Lys Arg Thr Gly Lys Pro Leu Met lie His Asn Arg Gin Ala 
50 55 60 
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Asp Arg Asp Val Leu Asp Val Leu Arg Ala Glu Gly Ala Pro Asp Thr 

65 70 75 80 

Val lie Leu His Cys Phe Ser Ser Asp Ala Ala Met Ala Arg Thr Cys 
85 90 95 

Val Asp Ala Gly Trp Leu Leu Ser Leu Ser Gly Thr Val Ser Phe Arg 
100 105 110 

Thr Ala Arg Glu Leu Arg Glu Ala Val Pro Leu Met Pro Val Glu Gin 

115 120 125 

Leu Leu Val Glu Thr Asp Ala Pro Tyr Leu Thr Pro 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Asp Glu Ser Leu Phe Glu Lys Phe Val Gly His Gin Lys Cys Val Ala 
15 10 15 

lie Gly Glu Cys Gly Leu Asp Tyr Tyr Arg Leu Pro Glu Leu Asn Glu 
20 25 30 

Arg Glu Asn Tyr Lys Ser Lys Gin Lys Glu lie Phe Thr Lys Gin lie 
35 40 45 

Glu Phe Ser lie Gin His Asn Lys Pro Leu lie lie His lie Arg Glu 
50 55 60 

Ala Ser Phe Asp Ser Leu Asn Leu Leu Lys Asn Tyr Pro Lys Ala Phe 
65 70 75 80 

Gly Val Leu His Cys Phe Asn Ala Asp Gly Met Leu Leu Glu Leu Ser 
85 90 95 

Asp Arg Phe Tyr Tyr Gly lie Gly Gly Val Ser Thr Phe Lys Asn Ala 
100 105 110 

Lys Arg Leu Val Glu lie Leu Pro Lys lie Pro Lys Asn Arg Leu Leu 
115 120 125 

Leu Glu Thr Asp Ser Pro Tyr Leu Thr Pro 
130 135 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Asp Ala Glu Arg Leu Leu Arg Leu Ala Gin Asp Pro Lys Val lie Ala 
15 10 15 

lie Gly Glu lie Gly Leu Asp Tyr Tyr Tyr Ser Ala Asp Asn Lys Ala 
20 25 30 

Ala Gin Gin Ala Val Phe Gly Ser Gin lie Asp lie Ala Asn Gin Leu 
35 40 45 

Asp Lys Pro Val lie lie His Thr Arg Ser Ala Gly Asp Asp Thr lie 
50 55 60 

Ala Met Leu Arg Gin His Arg Ala Glu Lys Cys Gly Gly Val lie His 
65 70 75 80 

Cys Phe Thr Glu Thr Met Glu Phe Xaa Lys Lys Ala Leu Asp Leu Gly 
85 90 95 

Phe Tyr lie Ser Cys Ser Gly lie Val Thr Phe Lys Asn Ala Glu Ala 
100 105 110 

lie Arg Glu Val lie Arg Tyr Val Pro Met Glu Arg Leu Leu Val Glu 
115 120 125 

Thr Asp Ser Pro Tyr Leu Ala Pro 
130 135 

INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

Asp Leu Ala Trp lie Lys Glu Leu Ser Ala His Glu Lys Val Val Ala 
15 10 15 

lie Gly Glu Met Gly Leu Asp Tyr His Trp Asp Lys Ser Pro Lys Asp 
20 25 30 

He Gin Lys Glu Val Phe Arg Asn Gin He Ala Leu Ala Lys Glu Val 
35 40 45 

Asn Leu Pro He He He His Asn Arg Asp Ala Thr Glu Asp Val Val 
50 55 60 

Thr He Leu Lys Glu Glu Gly Ala Glu Ala Val Gly Gly He Met His 
65 70 75 80 

Cys Phe Thr Gly Ser Ala Glu Val Ala Arg Glu Cys Met Lys Met Asn 
85 90 95 

Phe Tyr Leu Ser Phe Gly Gly Pro Val Thr Phe Lys Asn Ala Lys Lys 
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100 105 110 

Pro Lys Glu Val Val Lys GIu lie Pro Asn Asp Arg Leu Leu lie Glu 
115 120 125 

Thr Asp Cys Pro Phe Leu Thr Pro 
130 135 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: xinknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Glu Ala Leu Ala Asn Lys Gly Lys Ala Ser Gly Lys Val Val Ala Phe 
15 10 15 

Gly Glu Phe Gly Leu Asp Tyr Asp Arg Leu His Tyr Ala Pro Ala Asp 
20 25 30 

Val Gin Lys Met Tyr Phe Glu Glu Gin Leu Lys Val Ala Val Arg Val 
35 40 45 

Gin Leu Pro Leu Phe Leu His Ser Arg Asn Ala Glu Asn Asp Phe Phe 
50 55 60 

Ala lie Leu Glu Lys Tyr Leu Pro Glu Leu Pro Lys Lys Gly Val Val 
65 70 75 80 

His Ser Phe Thr Gly Ser lie Asp Glu Met Arg Arg Cys lie Glu His 
85 90 95 

Gly Leu Tyr Val Gly Val Asn Gly Cys Ser Leu Lys Thr Glu Glu Asn 
100 105 110 

Leu Glu Val Val Arg Ala lie Pro Leu Glu Lys Met Leu Leu Glu Thr 
115 120 125 

Asp Ala Pro Trp Cys Glu Val 
130 135 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

His He Ser Lys Met Glu Gin Phe Phe Val Glu His Glu Arg Asp He 
15 10 15 
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lie Cys Val Gly Glu Cys Gly Leu Asp His Thr He Ser Gin Phe Lys 
20 25 30 

Leu Thr Thr Glu Asp Phe Glu Glu Gin Glu Thr Val Phe Lys Trp Gin 
35 40 45 

lie Asp Leu Ala Lys His Phe Glu Lys Pro Leu He Leu Glu He Pro 
50 55 60 

Asp He Ser Arg Asn Val His Ser Arg Ser Ala Ala Arg Arg Thr He 

65 70 75 80 

Glu He Leu Leu Glu Cys His Val Ala Pro Asp Gin Val Val Leu His 
85 90 95 

Ala Phe Asp Gly Thr Pro Gly Asp Leu Lys Leu Gly Leu Glu Ala Gly 
100 105 110 

Tyr Leu Phe Ser He Pro Pro Ser Phe Gly Lys Ser Glu Glu Thr Thr 

115 120 125 

Gin Leu He Glu Ser He Pro Leu Ser Gin Leu Leu Leu Glu Thr Asp 
130 135 140 

Ser Pro Ala Leu Gly 
145 

INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

Gin Glu Arg Asn Leu Leu Gin Ala Leu Arg His Pro Lys Ala Val Ala 
15 10 15 

Phe Gly Glu Met Gly Leu Asp Tyr Ser Tyr Lys Cys Thr Thr Pro Val 
20 25 30 

Pro Glu Gin His Lys Val Phe Glu Arg Gin Leu Gin Leu Ala Val Ser 
35 40 45 

Leu Lys Lys Pro Leu Val He His Cys Arg Glu Ala Asp Glu Asp Leu 
50 55 60 

Leu Glu He Met Lys Lys Phe Val Pro Pro Asp Tyr Lys He His Arg 
65 70 75 80 

His Cys Phe Thr Gly Ser Tyr Pro Val He Glu Pro Leu Leu Lys Tyr 
85 90 95 

Phe Pro Asn Met Ser Val Gly Phe Thr Ala Val Leu Thr Tyr Ser Ser 
100 105 110 

Ala Trp Glu Ala Arg Glu Ala Leu Arg Gin He Pro Leu Glu Arg He 
115 120 125 
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lie Val Glu Thr Asp Ala Pro Tyr Phe Leu Pro 

130 135 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

Ser Arg Arg Ser Phe Leu Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Thr Arg Arg Ser Phe Leu Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Met Lys Thr Lys He Pro Asp Ala Val Leu Ala Ala Glu Val Ser Arg 
15 10 15 

Arg Gly Leu Val Lys Thr Thr He Ala Phe Phe Leu Ala Met Ala Ser 
20 25 30 

Ser Ala Leu Thr Leu Pro Phe Ser Arg He Ala His Ala Val Asp Ser 
35 40 45 

Ala He 
50 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

TTAGTCGGAT TAATCACAAT GTCGATAGCG 30 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



ATTCTGGCTG 


GGTGCCACCA 


GATACCAACG 


TTGAAGAGTT 


CGAATTTGCC 


ATTCGTACGG 


60 


TCTGTGAACC 


TATCTTTGAG 


AAACCGCTGG 


CCGAAATTTC 


GTTTGGACAT 


GTACTGTTAA 


120 


ATCTGTTTAA 


TACGGCGCGT 


CGCTTCAATA 


TGGAAGTGCA 


GCCGCAACTG 


GTGTTACTCC 


180 


AGAAAACCCT 


GCTCTACGTC 


GAAGGGGTAG 


GACGCCAGCT 


TTATCCGCAA 


CTCGATTTAT 


240 


GGAAAACGGC 


GAAGCCTTTC 


CTGGAGTCGT 


GGATTAAAGA 


TCAGGTCGGT 


ATTCCTGCGC 


300 


TGGTGAGAGC 


ATTTAAAGAA 


AAAGCGCCGT 


TCTGGGTCGA 


AAAAATGCCA 


GAACTGCCTG 


360 


AATTGGTTTA 


CGACAGTTTG 


CGCCAGGGCA 


AGTATTTACA 


GCACAGTGTT 


GATAAGATTG 


420 


CCCGCGAGCT 


TCAGTCAAAT 


CATGTACGTC 


AGGGACAATC 


GCGTTATTTT 


CTCGGAATTG 


480 


GCGCTACGTT 


AGTATTAAGT 


GGCACATTCT 


TGTTGGTCAG 


CCGACCTGAA 


TGGGGGCTGA 


540 


TGCCCGGCTG 


GTTAATGGCA 


GGTGGTCTGA 


TCGCCTGGTT 


TGTCGGTTGG 


CGCAAAACAC 


600 


GCTGATTTTT 


TCATCGCTCA 


AGGCGGGCCG 


TGTAACGTAT 


AATGCGGCTT 


TGTTTAATCA 


660 


TCATCTACCA 


CAGAGGAACA 


TGTATGGGTG 


GTATCAGTAT 


TTGGCAGTTA 


TTGATTATTG 


720 


CCGTCATCGT 


TGTACTGCTT 


TTTGGCACCA 


AAAAGCTCGG 


CTCCATCGGT 


TCCGATCTTG 


780 


GTGCGTCGAT 


CAAAGGCTTT 


AAAAAAGCAA 


TGAGCGATGA 


TGAACCAAAG 


CAGGATAAAA 


840 


CCAGTCAGGA 


TGCTGATTTT 


ACTGCGAAAA 


CTATCGCCGA 


TAAGCAGGCG 


GATACGAATC 


900 


AGGAACAGGC 


TAAAACAGAA 


GACGCGAAGC 


GCCACGATAA 


AGAGCAGGTG 


TAATCCGTGT 


960 


TTGATATCGG 


TTTTAGCGAA 


CTGCTATTGG 


TGTTCATCAT 


CGGCCTCGTC 


GTTCTGGGGC 


1020 


CGCAACGACT 


GCCTGTGGCG 


GTAAAT^CGG 


TAGCGGGCTG 


GATTCGCGCG 


TTGCGTTCAC 


1080 


TGGCGACAAC 


GGTGCAGAAC 


GAACTGACCC 


AGGAGTTAAA 


ACTCCAGGAG 


TTTCAGGACA 


1140 


GTCTGAAAAA 


GGTTGAAAAG 


GCGAGCCTCA 


CTAACCTGAC 


GCCCGAACTG 


AAAGCGTCGA 


1200 
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TGGATGAACT 


ACGCCAGGCC 


GCGGAGTCGA 


TGAAGCGTTC 


CTACGTTGCA 


AACGATCCTG 


1260 


AAAAGGCGAG 


CGATGAAGCG 


CACACCATCC 


ATAACCCGGT 


GGTGAAAGAT 


AATGAAGCTG 


1320 


CGCATGAGGG 


CGTAACGCCT 


GCCGCTGCAC 


AAACGCAGGC 


CAGTTCGCCG 


GAACAGAAGC 


1380 


CAGAAACCAC 


GCCAGAGCCG 


GTGGTAAAAC 


CTGCTGCGGA 


CGCTGAACCG 


AAAACCGCTG 


1440 


CACCTTCCCC 


TTCGTCGAGT 


GATAAACCGT 


AAACATGTCT 


GTAGAAGATA 


CTCAACCGCT 


1500 


TATCACGCAT 


CTGATTGAGC 


TGCGTAAGCG 


TCTGCTGAAC 


TGCATTATCG 


CGGTGATCGT 


1560 


GATATTCCTG 


TGTCTGGTCT 


ATTTCGCCAA 


TGACATCTAT 


CACCTGGTAT 


CCGCGCCATT 


1620 


GATCAAGCAG 


TTGCCGCAAG 


GTTCAACGAT 


GATCGCCACC 


GACGTGGCCT 


CGCCGTTCTT 


1680 


TACGCCGATC 


AAGCTGACCT 


TTATGGTGTC 


GCTGATTCTG 


TCAGCGCCGG 


TGATTCTCTA 


1740 


TCAGGTGTGG 


GCATTTATCG 


CCCCAGCGCT 


GTATAAGCAT 


GAACGTCGCC 


TGGTGGTGCC 


1800 


GCTGCTGGTT 


TCCAGCTCTC 


TGCTGTTTTA 


TATCGGCATG 


GCATTCGCCT 


ACTTTGTGGT 


1860 


CTTTCCGCTG 


GCATTTGGCT 


TCCTTGCCAA 


TACCGCGCCG 


GAAGGGGTGC 


AGGTATCCAC 


1920 


CGACATCGCC 


AGCTATTTAA 


GCTTCGTTAT 


GGCGCTGTTT 


ATGGCGTTTG 


GTGTCTCCTT 


1980 


TGAAGTGCCG 


GTAGCAATTG 


TGCTGCTGTG 


CTGGATGGGG 


ATTACCTCGC 


CAGAAGACTT 


2040 


ACGCAAAAAA 


CGCCCGTATG 


TGCTGGTTGG 


TGCATTCGTT 


GTCGGGATGT 


TGCTGACGCC 


2100 


GCCGGATGTC 


TTCTCGCAAA 


CGCTGTTGGC 


GATCCCGATG 


TACTGTCTGT 


TTGAAATCGG 


2160 


TGTCTTCTTC 


TCACGCTTTT 


ACGTTGGTAA 


AGGGCGAAAT 


CGGGAAGAGG 


AAAACGACGC 


2220 


TGAAGCAGAA 


AGCGAAAATiJ^ 


CTGAAGAATA 


AATTCAACCG 


CCCGTCAGGG 


CGGTTGTCAT 


2280 


ATGGAGTACA 


GGATGTTTGA 


TATCGGCGTT 


AATTTGACCA 


GTTCGCAATT 


TGCGAAAGAC 


2340 


CGTGATGATG 


TTGTAGCGTG 


CGCTTTTGAC 


GCGGGAGTTA 


ATGGGCTACT 


CATCACCGGC 


2400 


ACTAACCTGC 


GTGAAAGCCA 


GCAGGCGCAA 


AAGCTGGCGC 


GTCAGTATTC 


GTCCTGTTGG 


2460 


TCAhCGGCGG 


GCGTACATCC 


TCACGACAGC 


AGCCAGTGGC 


AAGCTGCGAC 


TGAAGAAGCG 


2520 


ATTATTGAGC 


TGGCCGCGCA 


GCCAGAAGTG 


GTGGCGATTG 


GTGAATGTGG 


TCTCGACTTT 


2580 


AACCGCAACT 


TTTCGACGCC 


GGAAGAGCAG 


GAACGCGCTT 


TTGTTGCCCA 


GCTACGCATT 


2640 


GCCGCAGATT 


TAAACATGCC 


GGTATTTATG 


CACTGTCGCG 


ATGCCCACGA 


GCGGTTTATG 


2700 


ACATTGCTGG 


AGCCGTGGCT 


GGATAAACTG 


CCTGGTGCGG 


TTCTTCATTG 


CTTTACCGGC 


2760 


ACACGCGAAG 


AGATGCAGGC 


GTGCGTGGCG 


CATGGAATTT 


ATATCGGCAT 


TACCGGTTGG 


2820 


GTTTGCGATG 


AACGACGCGG 


ACTGGAGCTG 


CGGGAACTTT 


TGCCGTTGAT 


TCCGGCGGAA 


2880 


AAATTACTGA 


TCGAAACTGA 


TGCGCCGTAT 


CTGCTCCCTC 


GCGATCTCAC 


GCCAAAGCCA 


2940 


TCATCCCGGC 


GCAACGAGCC 


AGCCCATCTG 


CCCCATATTT 


TGCAACGTAT 


TGCGCACTGG 


3000 


CGTGGAGAAG 


ATGCCGCATG 


GCTGGCTGCC 


ACCACGGATG 


CTAATGCCAA 


AACACTGTTT 


3060 
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GGGATTGCGT TTTAGAGTTT GCGGAACTCG GTATTCTTCA CACTGTGCTT AATCTCTTTA 3120 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

ATGCGGCTTT GTTTAATCAT CATCTACCAC AGAGGAACAT GTATGGGTGG TATCAGTATT 60 

TGGCAGTTAT TGATTATTGC CGTCATCGTT GTACTGCTTT TTGGCACCAA AAAGCTCGGC 120 

TCCATCGGTT CCGATCTTGG TGCGTCGATC AAAGGCTTTA AAAAAGCAAT GAGCGATGAT 180 

GAACCAAAGC AGGATAAT^C CAGTCAGGAT GCTGATTTTA CTGCGAAAAC TATCGCCGAT 240 

AAGCAGGCGG ATACGAATCA GGAACAGGCT AAAACAGAAG ACGCGAAGCG CCACGATAAA 300 

GAGCAGGTGT AA 312 
(2) INFORMATION FOR SEQ ID N0:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Arg Leu Cys Leu lie lie lie Tyr His Arg Gly Thr Cys Met Gly 
15 10 15 

Gly He Ser He Trp Gin Leu Leu He He Ala Val He Val Val Leu 
20 25 30 

Leu Phe Gly Thr Lys Lys Leu Gly Ser He Gly Ser Asp Leu Gly Ala 
35 40 45 

Ser He Lys Gly Phe Lys Lys Ala Met Ser Asp Asp Glu Pro Lys Gin 

50 55 60 

Asp Lys Thr Ser Gin Asp Ala Asp Phe Thr Ala Lys Thr He Ala Asp 
65 70 75 80 

Lys Gin Ala Asp Thr Asn Gin Glu Gin Ala Lys Thr Glu Asp Ala Lys 
85 90 95 

Arg His Asp Lys Glu Gin Val 

100 

(2) INFORMATION FOR SEQ ID NO: 48: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiable 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

TGTTTGATAT CGGTTTTAGC GAACTGCTAT TGGTGTTCAT CATCGGCCTC GTCGTTCTGG 60 

GGCCGCAACG ACTGCCTGTG GCGGTAAAAA CGGTAGCGGG CTGGATTCGC GCGTTGCGTT 120 

CACTGGCGAC AACGGTGCAG AACGAACTGA CCCAGGAGTT AAAACTCCAG GAGTTTCAGG 180 

ACAGTCTG7VA AAAGGTTGAA AAGGCGAGCC TCACTAACCT GACGCCCGAA CTGAAAGCGT 240 

CGATGGATGA ACTACGCCAG GCCGCGGAGT CGATGAAGCG TTCCTACGTT GCAAACGATC 300 

CTGAAAAGGC GAGCGATGAA GCGCACACCA TCCATAACCC GGTGGTGAAA GATAATGAAG 360 

CTGCGCATGA GGGCGTAACG CCTGCCGCTG CACAAACGCA GGCCAGTTCG CCGGAACAGA 420 

AGCCAGAAAC CACGCCAGAG CCGGTGGTAA AACCTGCTGC GGACGCTGAA CCGAAAACCG 480 

CTGCACCTTC CCCTTCGTCG AGTGATAAAC CGTAA 515 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

Val Phe Asp lie Gly Phe Ser Glu Leu Leu Leu Val Phe lie lie Gly 
15 10 15 

Leu Val Val Leu Gly Pro Gin Arg Leu Pro Val Ala Val Lys Thr Val 
20 25 30 

Ala Gly Trp lie Arg Ala Leu Arg Ser Leu Ala Thr Thr Val Gin Asn 
35 40 45 

Glu Leu Thr Gin Glu Leu Lys Leu Gin Glu Phe Gin Asp Ser Leu Lys 
50 55 60 

Lys Val Glu Lys Ala Ser Leu Thr Asn Leu Thr Pro Glu Leu Lys Ala 
65 70 75 80 

Ser Met Asp Glu Leu Arg Gin Ala Ala Glu Ser Met Lys Arg Ser Tyr 
85 90 95 

Val Ala Asn Asp Pro Glu Lys Ala Ser Asp Glu Ala His Thr lie His 
100 105 110 

Asn Pro Val Val Lys Asp Asn Glu Ala Ala His Glu Gly Val Thr Pro 
115 120 125 
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Ala Ala Ala Gin Thr Gin Ala Ser Ser Pro Glu Gin Lys Pro Glu Thr 

130 135 140 

Thr Pro Glu Pro Val Val Lys Pro Ala Ala Asp Ala Glu Pro Lys Thr 
145 150 155 160 

Ala 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 

Met Glu Ala Arg Met Thr Gly Arg Arg Lys Val Thr Arg Arg Asp Ala 
15 10 15 

Met Ala Asp Ala Ala Arg Ala Val Gly Val Ala Cys Leu Gly Gly Phe 
20 25 30 

Ser Leu Ala Ala Leu Val Arg Thr Ala Ser Pro Val Asp Ala 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Ser Arg Ser Ala Lys Pro Gin Asn Gly Arg Arg Arg Phe Leu Arg 
15 10 15 

Asp Val Val Arg Thr Ala Gly Gly Leu Ala Ala Val Gly Val Ala Leu 
20 25 30 

Gly Leu Gin Gin Gin Thr Ala Arg Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
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Met Thr Trp Ser Arg Arg Gin Phe Leu Thr Gly Val Gly Val Leu Ala 
15 10 15 

Ala Val Ser Gly Thr Ala Gly Arg Val Val Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Met Asp Arg Arg Arg Phe Leu Thr Leu Leu Gly Ser Ala Gly Leu Thr 
15 10 15 

Ala Thr Val Ala Thr Ala Gly Thr Ala Lys Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met Ser Glu Lys Asp Lys Met lie Thr Arg Arg Asp Ala Leu Arg Asn 
15 10 15 

lie Ala Val Val Val Gly Ser Val Ala Thr Thr Thr Met Met Gly Val 
20 25 30 

Gly Val Ala Asp Ala 
35 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Gin He Val Asn Leu Thr Arg Arg Gly Phe Leu Lys Ala Ala Cys 
15 10 15 

Val Val Thr Gly Gly Ala Leu He Ser He Arg Met Thr Gly Lys Ala 
20 25 30 
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Val Ala 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: iinknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Met Asn Asn Glu Glu Thr Phe Tyr Gin Ala Met Arg Arg Gin Gly Val 
15 10 15 

Thr Arg Arg Ser Phe Leu Lys Tyr Cys Ser Leu Ala Ala Thr Ser Leu 
20 25 30 

Gly Leu Gly Ala Gly Met Ala Pro Lys lie Ala Trp Ala 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Met Ser Thr Gly Thr Thr Asn Leu Val Arg Thr Leu Asp Ser Met Asp 
15 10 15 

Phe Leu Lys Met Asp Arg Arg Thr Phe Met Lys Ala Val Ser Ala Leu 
20 25 30 

Gly Ala Thr Ala Phe Leu Gly Thr Tyr Gin Thr Glu He Val Asn Ala 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Met Lys Cys Tyr He Gly Arg Gly Lys Asn Gin Val Glu Glu Arg Leu 
15 10 15 

Glu Arg Arg Gly Val Ser Arg Arg Asp Phe Met Lys Phe Cys Thr Ala 
20 25 30 
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Val Ala Val Ala Met Gly Met Gly Pro Ala Phe Ala Pro Lys Val Ala 
35 40 45 

Glu Ala 
50 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Met Asn Arg Arg Asn Phe lie Lys Ala Ala Ser Cys Gly Ala Leu Leu 
15 10 15 

Thr Gly Ala Leu Pro Ser Val Ser His Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Met Ser His Ala Asp Glu His Ala Gly Asp His Gly Ala Thr Arg Arg 
15 10 15 

Asp Phe Leu Tyr Tyr Ala Thr Ala Gly Ala Gly Thr Val Ala Ala Gly 
20 25 30 

Ala Ala Ala Trp Thr Leu Val Asn Gin Met Asn Pro 
35 40 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Met Thr Gin lie Ser Gly Ser Pro Asp Val Pro Asp Leu Gly Arg Arg 
15 10 15 

Gin Phe Met Asn Leu Leu Thr Phe Gly Thr He Thr Gly Val Ala Ala 
20 25 30 
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Gly Ala Leu Tyr Pro Ala Val Lys Tyr Leu lie Pro 
35 40 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknovm 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Met Asp Arg Arg Thr Phe Leu Arg Leu Tyr Leu Leu Val Gly Ala Ala 
15 10 15 

lie Ala Val Ala Pro Val lie Lys Pro Ala Leu Asp Tyr Val Gly Tyr 
20 25 30 

(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Met Thr Lys Leu Ser Gly Gin Glu Leu His Ala Glu Leu Ser Arg Arg 
15 10 15 

Ala Phe Leu Ser Tyr Thr Ala Ala Val Gly Ala Leu Gly Leu Cys Gly 
20 25 30 

Thr Ser Leu Leu Ala Gin Gly Ala Arg Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Met Thr Leu Thr Arg Arg Glu Phe lie Lys His Ser Gly lie Ala Ala 
15 10 15 

Gly Ala Leu Val Val Thr Ser Ala Ala Pro Leu Pro Ala Trp Ala 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 65: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

Met Thr lie Ser Arg Arg Asp Leu Leu Lys Ala Gin Ala Ala Gly lie 
15 10 15 

Ala Ala Met Ala Ala Asn lie Pro Leu Ser Ser Gin Ala Pro Ala 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Met Ser Glu Ala Leu Ser Gly Arg Gly Asn Asp Arg Arg Lys Phe Leu 
15 10 15 

Lys Met Ser Ala Leu Ala Gly Val Ala Gly Val Ser Gin Ala Val Gly 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Met Lys Thr Lys lie Pro Asp Ala Val Leu Ala Ala Glu Val Ser Arg 

15 10 15 

Arg Gly Leu Val Lys Thr Thr Ala He Gly Gly Leu Ala Met Ala Ser 
20 25 30 

Ser Ala Leu Thr Leu Pro Phe Ser Arg He Ala His Ala 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Met Ser Asn Phe Asn Gin lie Ser Arg Arg Asp Phe Val Lys Ala Ser 
15 10 15 

Ser Ala Gly Ala Ala Leu Ala Val Ser Asn Leu Thr Leu Pro Phe Asn 
20 25 30 

Val Met Ala 
35 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Met Ser lie Ser Arg Arg Ser Phe Leu Gin Gly Val Gly lie Gly Cys 
15 10 15 

Ser Ala Cys Ala Leu Gly Ala Phe Pro Pro Gly Ala Leu Ala 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Met Lys Thr Val Leu Pro Ser Val Pro Glu Thr Val Arg Leu Ser Arg 
15 10 15 

Arg Gly Phe Leu Val Gin Ala Gly Thr He Thr Cys Ser Val Ala Phe 
20 25 30 

Gly Ser Val Pro Ala 
35 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Met Gly Arg Leu Asn Arg Phe Arg Leu Gly Lys Asp Gly Arg Arg Glu 
15 10 15 

Gin Ala Ser Leu Ser Arg Arg Gly Phe Leu Val Thr Ser Leu Gly Ala 
20 25 30 

Gly Val Met Phe Gly Phe Ala Arg Pro Ser Ser Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Met Ser Asp Lys Asp Ser Lys Asn Thr Pro Gin Val Pro Glu Lys Leu 
15 10 15 

Gly Leu Ser Arg Arg Gly Phe Leu Gly Ala Ser Ala Val Thr Gly Ala 
20 25 30 

Ala Val Ala Ala Thr Ala Leu Gly Gly Ala Val Met Thr Arg Glu Ser 
35 40 45 

Trp Ala 
50 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Met Glu Ser Arg Thr Ser Arg Arg Thr Phe Val Lys Gly Leu Ala Ala 
15 10 15 

Ala Gly Val Leu Gly Gly Leu Gly Leu Trp Arg Ser Pro Ser Trp Ala 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Met Ser Leu Ser Arg Arg Gin Phe lie Gin Ala Ser Gly lie Ala Leu 
15 10 15 

Cys Ala Gly Ala Val Pro Leu Lys Ala Ser Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Met Leu Gly Lys Ser Gin Phe Asp Asp Leu Phe Glu Lys Met Ser Arg 
15 10 15 

Lys Val Ala Gly His Thr Ser Arg Arg Gly Phe lie Gly Arg Val Gly 
20 25 30 

Thr Ala Val Ala Gly Val Ala Leu Val Pro Leu Leu Pro Val Asp Arg 
35 40 45 

Arg Gly Arg Val Ser Arg Ala Asn Ala 
50 55 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Met Thr Leu Asn Arg Arg Asp Phe lie Lys Thr Ser Gly Ala Ala Val 
15 10 15 

Ala Ala Val Gly lie Leu Gly Phe Pro His Leu Ala Phe Gly 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Met Thr Asp Ser Arg Ala Asn Arg Ala Asp Ala Thr Arg Gly Val Ala 
15 10 15 

Ser Val Ser Arg Arg Arg Phe Leu Ala Gly Ala Gly Leu Thr Ala Gly 
20 25 30 

Ala lie Ala Leu Ser Ser Met Ser Thr Ser Ala Ser Ala 
35 40 45 
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